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© Recombinant thermostable DNA polymerase from archaebacteria. 



@ Recombinant DNA polymerases from 
archaebacteria as well as isolated DNA coding 
for such polymerases are provided. The isolated 
DNA is obtained by use of DNA or antibody 
probes prepared from the DNA encoding 7". 
litoralis DNA polymerase and the T. litoralis DNA 
polymerase respectively. Also provided are 
methods for producing recombinant 
archaebacteria thermostable DNA polymerase 
and methods for enhancing the expression of 
such polymerases by identifying, locating and 
removing introns from within the DNA coding 
for such DNA polymerases. 
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FIELD OF THE INVENTION 

The present invention relates to recombinant DNA polymerases from archaebacterium, to isolated DNA 
coding for said DNA polymerases which hybridizes to DNA probes prepared from the DNA sequence coding 
5 for T. litoralis DNA polymerase, to DNA and antibody probes employed in the isolation of said DNA, as well as 
to related methods for isolating said DNA and methods of identifying, locating and removing intervening nu- 
cleotide sequences within said DNA in order to enhance expression of said DNA polymerases 

BACKGROUND OF THE INVENTION 

10 

DNA polymerases are a family of enzymes involved in DNA repair and replication. Extensive research has 
been conducted on the isolation of DNA polymerases from mesophilic microorganisms such as E. coli. See, 
for example, Bessman, etal., J. Biol. Chem. (1 957) 233: 1 71 -1 77 and Buttin and Kornberg J. Biol. Chem.. (1966) 
241:5419-5427. 

15 Examples of DNA polymerases isolated from E. Coli include E. coli DNA polymerase I, Klenow fragment 

of E. coli DNA polymerase I and T4 DNA polymerase. These enzymes have a variety of uses in recombinant 
DNA technology including, for example, labelling of DNA by nick translation, second-strand cDNA synthesis in 
cDNA cloning, and DNA sequencing. See Maniatis, et al., Molecular Cloning: A Laboratory Manual (1982). 
Recently, U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159 disclosed the use of the above enzymes 

20 in a process for amplifying, detecting, and/or cloning nucleic acid sequences. This process, commonly referred 
to as polymerase chain reaction (PCR), involves the use of a polymerase, primers and nucleotide triphosphates 
to amplify existing nucleic acid sequences. 

Some of the DNA polymerases discussed above possess a 3'-5' exonuclease activity which provides a 
proofreading function that gives DNA replication much higher fidelity that it would have if synthetis were the 

25 result of only a one base-pairing selection step. Brutlag, D. and Kornberg, A., J. Biol. Chem., (1972) 247:241- 
248. DNA polymerases with 3'-5' proofreading exonuclease activity have a substantially lower base incorpor- 
ation error rate when compared with a non-proofreading exonuclease-possessing polymerase. Chang, L.M.S., 
J. Biol. Chem., (1977) 252:1873-1880. 

Research has also been conducted on the isolation and purification of DNA polymerases from thermo- 

30 philes, such as Thermus aquaticus. Chien, A., etal. J. Bacteriol.. (1976) 127:1550-1557, discloses the isolation 
and purification of a DNA polymerase with a temperature optimum of 80°C from T. aquaticus YT1 strain. The 
Chien, et at., purification procedure involves a four-step process. These steps involve preparation of crude ex- 
tract, DEAE-Sephadex chromatography, phosphocellulose chromatography, and chromatography on DNA cel- 
lulose. Kaledin, et al., Biokhymiyay (1980) 45:644-651 also discloses the isolation and purification of a DNA 

35 polymerase from cells of 7: aquaticus YT1 strain. The Kaledin, et al. purification procedure involves a six-step 
process. These steps involve isolation of crude extract, ammonium sulfate precipitation, DEAE-cellulose chro- 
matography, fractionation on hydroxyapatite, fractionation on DEAE-cellulose, and chromatography on single- 
strand DNA-cellulose. 

United States Patent No. 4,889,818 discloses a purified thermostable DNA polymerase from T. aquaticus, 

40 Taq polymerase, having a molecular weight of about 86,000 to 90,000 daltons prepared by a process substan- 
tially identical to the process of Kaledin with the addition of the substitution of a phosphocellulose chromatog- 
raphy step in lieu of chromatography on single-strand DNA-cellulose. In addition, European Patent Application 
0258017 discloses Taq polymerase as the preferred enzyme for use in the PCR process discussed above. 
Research has indicated that while the Taq DNA polymerase has a 5'-3' polymerase-de pendent exonu- 

45 ' clease function, the Taq DNA polymerase does not possess a 3*-5' proofreading exonuclease function. Lawyer, 
F.C., et al. J. Biol. Chem., (1989) 264:11, p. 6427-6437. Bernard, A., et al. Cell (1989) 59:219. As a result, Taq 
DNA polymerase is prone to base incorporation errors, making its use in certain applications undesirable. For 
example, attempting to clone an amplified gene is problematic since any one copy of the gene may contain an 
error due to a random misincorporation event. Depending on where in the replication cycle that error occurs 

so (e.g., in an early replication cycle), the entire DNA amplified could contain the erroneously incorporated base, 
thus, giving rise to mutated gene product. Furthermore, research has indicated that Taq DNA polymerase has 
a thermal stability of not more than several minutes at 100°C. 

Accordingly, other DNA polymerases with comparable or improved thermal stability and/or 3' to 5' exonu- 
clease proofreading activity would be desirable for the scientific community. One such enzyme (described in 

55 more detail below), DNA polymerase from Thermococcus litoralis, an archaebacterium that grows at temper- 
atures close to 100°C near submarine thermal vents, has been cloned into E. coli. The production of large 
amounts of this recombinant enzyme protein from this gene is complicated, however, by the presence of two 
introns, one of which must be removed by genetic engineering techniques, and the other which encodes an 
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endo nuclease which is spliced out in E. coli. 

It would be desirable to obtain and produce other highly thermostable DNA polymerases from archaebac- 
terium which have a 3' to 5* proofreading activity and/or comparable or improved thermal stability so as to im- 
prove the DNA polymerase processes described above. 

5 

SUMMARY OF THE INVENTION 

in accordance with the present invention, there is provided methods and products for identifying, isolating 
and cloning DNA which encodes DNA polymerases from archaebacteria. The present invention also relates to 

10 recombinant DNA polymerases from archaebacteria as well as to methods of improving expression of said re- 
combinant DNA polymerases by identifying, locating and removing intervening nucleotide sequences or introns 
which occur within the DNA coding for said polymerases. 

More specifically, in accordance with the present invention, it has been discovered that DNA coding for 
DNA polymerases from archaebacterium have substantial homology both at the DNA and amino acid level. It 

15 has also been discovered that the DNA from archaebacterium coding for such enzymes appear to have one 
of more intervening nucleotides or introns which also share substantially homology at the DNA level. 

Thus, in accordance with the present invention, DNA probes can be constructed from the DNA sequence 
coding for one DNA polymerase from archaebacterium, such as Thermococcus litoralis, and used to identify 
and isolate DNA coding for DNA polymerases from other Archaebacterium such as Pyrococcus. Similarly, an- 

20 tibody probes which are cross-reactive with T, litoralis DNA polymerase can also be used to identify DNA coding 
coding sequences which express such other DNA polymerases. 

Once the DNA coding for the target DNA polymerase has been isolated, it can be used to construct ex- 
pression vectors in order to produce commercial quantitaties of the target DNA polymerase. In this regard, the 
present invention also provides methods of increasing expression levels of the target DNA polymerase by iden- 

25 tifying, locating and removing any intervening nucleotide sequences or introns which occur in the DNA se- 
quence coding for the DNA polymerase. As discussed below, while certain introns are spliced out in E. coli, 
expression of the recombinant DNA polymerase can be enhanced by removal of such intervening nucleotide 
sequences prior to expression in E. coli. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

is a photograph of the SDS-polyacrylamide gel of example 1. 

is a graph showing the polymerase activity and exonuclease activity of the proteins eluted from 
lane 2 of the gel in Fig. 1 A, 

35 FIG. 2 - is a restriction site map of the Xba fragment containing the gene encoding the T. litoralis DNA 
Polymerase which is entirely contained within the BamHI fragment of bacteriophage NEB 619. 
Figures 3A and 3B are graphs showing the half-life of native and recombinant T. litoralis DNA, 
respectively. 

is a graph showing the response of T. litoralis DNA polymerase and Klenow fragment to the pres- 
ence or absence of deoxynucleotides. 

is a restriction site map showing the organization of the T. litoralis DNA polymerase gene in native 
DNA (BamHI fragment of NEB 619) and in E. coli NEB671 and NEB687. 
is a partial nucleotide sequence of the 14 kb BamHI restriction fragment of bacteriophage 
NEB619 inclusive of the 1 .3 kb, 1 .6 kb and 1 .9 kb Eco Rl fragments and part of the Eco Rt/BamHI 
45 ' fragment. 

FIG. 7 - is a comparison of the amino acids in the DNA polymerase consensus homology region 111 with 

the amino acids of the T. litoralis homology island III. 
FIGS. 8-10 are representations of the vectors pPR969 and pCAS4 and V174-1B1 , respectively. 
FIG. 11 - is a graph illustrating the 7". litoralis DNA polymerase variant constructed in Example VI lacks de- 
50 tectable 3'to 5' exonuclease activity. 

FIG. 12 - is a nucleotide sequence of the primers used in Example III. 

FIG. 13A- is the ethidium bromide stained agarose gel of Pyrococcus sp. DNA cut with EcoR I (lane 3), 
BamH I (lane 4) and Hind 111 (lane 5). Lane 1 is M3NA cut with Hind III as markers and lane 2 is 
pBR322 as a marker. 

55 FIG. 1 3B- is an autoradiography of a Southern hybridization of the same gel in Fig. 1 3A. The ^P-DNA probe 
was prepared from a 1 .3 Kb Eco Rl fragment that encodes the amino terminal portion of the T. 
litoralis DNA polymerase. Note that the BamH I cut Pyrococcus sp. DNA gives a single band of 
about 4-5 Kb with the probe. The fact that the 23 Kd band of Hind III cut X.DNA shows up on the 
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film is due to nonspecific hybridization to the large amount of DNA present in that band. The fact 
that the plasmid pBR322 lights up is due to homologous sequences in the probe. 

FIG. 14 - is a restriction site map of the 4.8 Kb BamH I fragment containing the gene containing the Pyr- 
ococcus sp. DNA polymerase in the pUC19 plasmid of £ coli 2207 (NEB#720). 
5 FIG. 1 5 - illustrates the probes used to analyze the similarity of DNA for other target archaebacteria. 

FIG. 16 - is an autoradigraph of quadruplicate Southern blots decribed in Example XIV illustrating the hy- 
bridization of probes to 7: litorafis and Pyrococcus sp. DNA but not to T. aquaticus DNA. 

FIG. 17 - is a Western blot of crude lysates from T. litoralis (V), Pyrococcus sp. G-l-J (J), Pyrococcus sp. 

G-l-H (H), or purified polymerases from Pyrococcus sp. GB-D (DV), T. aquaticus (T) or E. coli 
10 (E) reacted with affinity purified anti-vent DNA polymerase antibody in Part A or anti-Taq DNA 

polymerase antibody in Part B. M represents the marker proteins. The arrow indicates the position 
of the 7^ fitoraiis and Pyrococcus sp. DNA polymerase proteins. The reactivity in Part B is to back- 
ground proteins and not to the DNA polymerases as seen in Part A. 

FIG. 1 8 - is a partial DNA nucleotide sequence of the gene coding for the Pyrococcus sp. DNA polymerase. 
15 FIG. 19 - is a comparison of the deduced amino acid sequence of Pyrococcus sp. DNA polymerase to T. 
litorafis DNA polymerase. 



DETAILED DESCRIPTION OF THE INVENTION 



20 In accordance with one preferred embodiment of the present invention, there is provided a method of pro- 

ducing recombinant DNA polymerase from archaebacterium. The preferred process comprises 1) forming a 
genomic library from the target archaebacterium, 2) transforming or transfecting an appropriate host cell, 3) 
either i) reacting the DNA from the transformed or transfected host cells with a DNA probe which hybridizes to 
the DNA coding for the DNA polymerase from T. litoralis, or ii) reacting the extract from the transformed or trans- 

25 fected host cells with an antibody probe which is cross-reactive with T. litoralis DNA polymerase, 4) assaying 
the transformed or transfected cells of step 3 which either hybridize to the DNA probe or cross react with the 
T. litoralis specific antibody for thermostable DNA polymerase activity. 

The aforementioned method allows for the production of recombinant DNA polymerases from archaebac- 
terium, as well as for the isolation of DNA coding for said polymerases. 

30 In accordance with another preferred embodiment, there is provided a method for enhancing the expression 
of recombinant DNA polymerases from archaebacterium. As noted above, it is believed that the DNA coding 
for DNA polymerases from archaebacterium may possess one or more introns which may complicate expres- 
sion of the target recombinant DNA polymerase. Location and removal of these introns prior to constructing 
the expression system has been found to enhance expression of the target DNA polymerase, even when the 

35 intron is normally spliced out in its host cell. As discussed in more detail below, the intron can be identified and 
removed in a number of ways. In particular, it has also been found that the introns of T. litoralis share substantial 
homology at the DNA level with other genuses of archaebacteria such as Pyrococcus. Knowledge of this fact 
should facilitate the identification, location and removal of introns by the methods described in more detail be- 
low. 

40 In practicing certain embodiments of the present invention it is preferable to employ either i) DNA probes 

which hybridize to the DNA coding for 7: litoralis DNA polymerase, or ii) antibodies which cross-react with T. 
litoralis DNA polymerase. DNA probes are preferably constructed based on the DNA sequence coding for the 
7: litoralis DNA polymerase (See Fig. 6), while the antibody probes are preferably made from the purified 7: 
litorafis enzyme itself. Following the procedures of the present invention, one could, of course construct probes 

45 ' based on the DNA polymerase or its DNA from other sources of archaebacterium. However, the preferred DNA 
polymerase and DNA used to construct such probes is from T. litoralis. 

Production of Native T. litoralis DNA Polymerase 

so T. litoralis DNA polymerase is obtainable from T. litorafis strain NS-C (DSM No. 5473, a sample of which 

has also been deposited at the American Type Culture Collection on September 17, 1991 under ATCC Acces- 
sion No. 55233). T. litoralis was isolated from a submarine thermal vent near Naples, Italy in 1985. This organ- 
ism, T. litoralis, is an extremely thermophilic, sulfur metabolizing, archaebacteria, with a growth range between 
55°C and 98°C. Neuner, et al., Arch. Microbiol. (1990) 153:205-207. 

55 For recovering the native protein, T. litoralis may be grown using any suitable technique, such as the tech- 

nique described by Belkin, et al., Arch Microbiol. (1985) 142:181-186, the disclosure of which is incorporated 
by reference. Briefly, the cells are grown in the media described above containing 10 mg/ml of sulfur and 0.01 
M cysteine in 15 ml screw cap tubes at95°Cfor2 days. When larger amounts of cells are required, 1 liter screw 
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cap bottles are used and after sterilization are inocculated with a fresh 10 ml culture and grown at 90-95?C for 
2 days. 

After cell growth, one preferred method for isolation and purification of the enzyme is accomplished using 
the multi-step process as follows: 
5 First, the cells, if frozen, are thawed, suspended in a suitable buffer such as buffer A(10 mM KP0 4 buffer, 

pH 7.4; 1.0 mM EDTA, 1.0 mM beta-mercaptoethanol), sonicated and centrifuged. The supernatant is then 
passed through a column which has a high affinity for proteins that bind to nucleic acids such as Affigel blue 
column (Biorad). The nucleic acids present in supernatant solution of T. litoralis and many of the proteins pass 
through the column and are thereby removed by washing the column with several column volumes of low salt 

10 buffer at pH of about 7.0. After washing, the enzyme is eluted with a linear gradient such as 0.1 to 2.0 M NaCI 
buffer A. The peak DNA polymerase activity is dialyzed and applied to a phosphocellulose column. The column 
is washed and the enzyme activity eluted with a linear gradient such as 0.1 to 1.0 M NaCI in buffer A. The peak 
DNA polymerase activity is dialyzed and applied to a DNA cellulose column. The column is washed and DNA 
polymerase activity is eluted with a linear gradient of 0.1 to 1.0 M NaCI in buffer A. The fractions containing 

is DNA polymerase activity are pooled, diaiyzed against buffer A. and applied to a high performance liquid chro- 
matography column (HPLC) mono-Q column (anion exchanger). The enzyme is again eluted with a linear gra- 
dient such as 0.05 to 1 .0 M NaCI in a buffer A. The fractions having thermostable polymerase activity are pooled, 
diluted and applied to HPLC mono-S column (cation exchanger). The enzyme is again eluted with a linear gra- 
dient such as 0.05 to 1.0 M NaCI in buffer A. The enzyme is about 50% pure at this stage. The enzyme may 

20 be further purified by precipitation of a contaminating lower molecular weight protein by repeated dialysis 
against buffer A supplemented with 50 mM NaCI. 

The apparent molecular weight of the DNA polymerase obtainable from T. litoralis is between about 90,000 
to 95,000 daltons when compared with protein standards of known molecular weight, such as phosphorylase 
B assigned a molecular weight of 97,400 daltons. It should be understood, however, that as a protein from an 

25 extreme thermophile, T. litoralis DNA polymerase may electrophorese at an aberrant relative molecular weight 
due to failure to completely denature or other intrinsic properties. The exact molecular weight of the thermo- 
stable enzyme of the present invention may be determined from the coding sequence of the T. litoralis DNA 
polymerase gene. The molecular weight of the eluted product may be determined by any technique, for exam- 
ple, by SDS-polyacrytamide gel electrophoresis (SDS-PAGE) using protein molecular weight markers. 

30 Polymerase activity is preferably measured by the incorporation of radioactively labeled deoxynucleotides 

into DNAse-treated, or activated, DNA; following subsequent separation of the unincorporated deoxynucleo- 
tides from the DNA substrate, polymerase activity is proportional to the amount of radioactivity in the acid- 
insoluble fraction comprising the DNA. Lehman, I.R., et al., J. Biol. Chem. (1958) 233:163, the disclosure of 
which ts incorporated herein by reference. 

35 The half-life of the DNA polymerase of the present invention at 100°C is about 60 minutes. The thermal 

stability or half-life of the DNA polymerase is determined by preincubating the enzyme at the temperature of 
interest in the presence of all assay components (buffer, MgCI 2 , deoxynucleotides, and activated DNA) except 
the single radioactively-labeled deoxynucleotide. At predetermined time intervals, ranging from 4-1 80 minutes, 
small aliquots are removed, and assayed for polymerase activity using the method described above. 

40 The half-life at 100°C of the DNA polymerase can also be determined in the presence of stabilizers such 

as the nonionic detergent octoxynol, commonly known as TRITON X-100 (Rohm & Haas Co.), or the protein 
bovine serum albumin (BSA). The non-ionic detergents polyoxyethytated (20) sorbitan monolaurate (Tween 20, 
ICI Americas Inc.) and ethoxylated alkyl Phenol (nonyl) (ICONOL NP-40, BASF Wyandotte Corp.) can also be 
used. Stabilizers are used to prevent the small amount of enzyme added to the reaction mixture from adhering 

45 to the sides of the tube or from changing its structural conformation in some manner that decreases its enzy- 
matic activity. The half-life at 100°C of the DNA polymerase obtainable from T. litoralis in the presence of the 
stabilizer TRITON X-100 or BSA is about 95 minutes. 

Preparation Of Recombinant 7: litoralis DNA Polymerase 

50 

T. litoralis DNA polymerase may also be produced by recombinant DNA techniques, as the gene encoding 
this enzyme has been cloned from T. litoralis genomic DNA. The complete coding sequence for the 7: litoralis 
DNA polymerase (Figure 6) can be derived from bacteriophage NEB #619 on an approximately 14 kb BamHI 
restriction fragment. This phage was deposited with the American Type Culture Collection (ATCC) on April 24, 
55 1990 and has Accession No. ATCC 40795. 

The production of a recombinant form of 7: litoralis DNA polymerase generally includes the following steps: 
DNA is isolated which encodes the active form of the polymerase, either in its native form or as a fusion with 
other sequences which may or may not be cleaved away from the native form of the polymerase and which 
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may or may not effect polymerase activity. Next, the gene is operably linked to appropriate control sequences 
for expression in either prokaryotic or eukaryotic host/vector systems. The vector preferably encodes all func- 
tions required for transformation and maintenance in a suitable host, and may encode selectable markers 
and/or control sequences for T. litoralis polymerase expression. Active recombinant thermostable polymerase 
5 can be produced by transformed host cultures either continuously or after induction of expression. Active ther- 
mostable polymerase can be recovered either from within host cells or from the culture media if the protein is 
secreted through the cell membrane. 

While each of the above steps can be accomplished in a number of ways, it has been found that for cloning 
the DNA encoding T. litoralis DNA polymerase, expression of the polymerase from its own control sequences 
10 in £ coli results in instability of the polymerase gene, high frequency of mutation in the polymerase gene, slow 
cell growth, and some degree of cell mortality. 

While not wishing to be bound by theory, it is believed that this instability is due at least in part to the pres- 
ence of a 1614 bp intron that splits the 7: litoralis DNA polymerase gene from nucleotides 1776 to 3389 of Fig. 
6, and a second 1170 bp intron that splits the 7: litoralis DNA polymerase gene from nucleotides 3534 to 4703. 
15 As discussed below, intervening sequences are also believed to be present in the DNA coding for DNA poly- 
merases from other archaebacteria. Introns from a number of arch ae bacteria are also believed to share sub- 
stantial homology to the introns present in the DNA for coding for T. litoralis DNA polymerase, which, in accor- 
dance with one aspect of the present invention, will facilitate their identification, location and removal. 

Introns are stretches of intervening DNA which separate coding regions of a gene (the protein coding re- 
20 gions are called exons). Introns can contain nonsense sequences or can code for proteins. In order to make 
a functional protein, the intron must be spliced out of the pre-mRNA before translation of the mature mRNA 
into protein. Introns were originally identified in eukaryotes, but have been recently described in certain pro- 
karyotes. (See, e.g., Krainerand Maniatis {Transcription and Splicing (1988) B.D. Hames and D.M. Glover, eds. 
IRL Press, Oxford and Washington, D.C. pp. 131-206)). When agene with an intron is transcribed into mRNA 
25 the intron may self-splice out to form a mature mRNA or cellular factors may be required to remove the intron 
from the pre-mRNA. Id. Bacterial introns often require genus specific co-factors for splicing. For example, a 
Bacillus intron may not be spliced out in E. coli. (Id). 

However, there is some evidence that suggests that the intervening DNA sequence within the gene coding 
for the T. titorafis DNA polymerase is transcribed and translated, and that the peptide produced therefrom is 
30 spliced out at the protein level, not the mRNA level. Therefore, regardless of where the splicing event occurs, 
in order to express T. litoralis DNA polymerase in £ coli, it is preferable to delete the intervening sequence 
prior to expression of the polymerase in an £ coli system. Of course, the recombinant vector containing the T. 
litoralis DNA polymerase gene could be expressed in systems which possess the appropriate factors for splicing 
the intron, for example, a Tbermococcus system. 
35 - it is also preferable that the T. litoralis thermostable polymerase expression be tightly controlled in £ coli 
during cloning and expression. Vectors useful in practicing the present invention should provide varying de- 
grees of controlled expression of T. litoralis polymerase by providing some or all of the following control fea- 
tures : (1) promoters or sites of initiation of transcription, either directly adjacent to the start of the polymerase 
or as fusion proteins, (2) operators which could be used to turn gene expression on or off, (3) ribosome binding 
40 sites for improved translation, and (4) transcription or translation termination sites for improved stability. Ap- 
propriate vectors used in cloning and expression of 7. litoralis polymerase include, for example, phage and plas- 
mids. Example of phage include Xgtll (Promega), X Dash (Stratagene) X Zapll (Stratagene). Examples of plas- 
mids include pBR322, pBluescript (Stratagene), pSP73 (Promega), pGW7 (ATCC No. 40166), pET3A (Rosen- 
berg, et al., Gene, (1987) 56:125-135), and pET11C (Methods in Enzymology (1990) 185:60-89). 

45 ' 

Transformation and Infection 

Standard protocols exist for transformation, phage infection and cell culture. Maniatis, et al., Molecular 
Cloning : A Laboratory Manual (1982). Of the numerous £ coli strains which can be used for plasmid trans- 

50 formation, the preferred strains include JM101 (ATCC No. 33876), XL1 (Stratagene), and RRI (ATCC No. 
31343), and BL21(DE3) plysS (Method in Enzomology (1990) supra). £ coli strain XL1, ER1578 and ER1458 
(Raleigh, et al., N.A. Research (1988) 16:1563-1575) are among the strains that can be used for lambda phage, 
and Y1089 can be used for lambda gtll lysogeny. When preparing transient iysogens in Y1089 (Arasu, et al., 
Experimental Parasitology (1 987) 64:281-289), a culture is infected with lambda gtll recombinant phage either 

55 by a single large dose of phage or by co-culturing with a lytic host The infected Y1 089 cells are preferably grown 
at 37°C in the presence of the inducer IPTG resulting in buildup of recombinant protein within the lysis-defective 
host/phage system. 
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Construction of Genomic DNA Expression Library and Screening for Thermostable Polymerase 

The most common methods of screening for a gene of choice are (1) by hybridization to homologous genes 
from other organisms, (2) selection of activity by complementation of a host defect, (3) reactivity with specific 

5 antibodies, or (4) screening for enzyme activity. For 7". litoralis, antibody detection is preferred since it initially 
only requires expression of a portion of the enzyme, not the complete active enzyme. The instability of the IT 
litoralis polymerase gene in E. coli would have made success by other methods more difficult. 

7! litoralis DNA can be used to construct genomic libraries as either random fragments or restriction enzyme 
fragments. The latter approach is preferred. Preferably, Eco Rl partials are prepared from T. litoralis genomic 

10 DNA using standard DNA restriction techniques such as described in Maniatis, et al., Molecular Cloning : A lab- 
oratory Manual (1 982), the disclosure of which is incorporated herein by reference. Other restriction enzymes 
such as BamHI, Nrul and Xbal can also be used. 

Although methods are available to screen both plasmids and phage using antibodies (Young and Davis, 
PNAS, (1983) 80:1194-1198), in accordance with the present invention it has been found that phage systems 

15 tend to work better and are therefore preferred for the first libraries. Since it is uncertain whether T. litoralis con- 
trol regions function in E. co//, phage vectors which supply all necessary expression control regions such as 
lambda gt11 and lambda Zap II, are preferred. By cloning T. litoralis DNA into the Eco Rl site of lambda gtil, T. 
litoralis polymerase may be expressed either as a fusion protein with beat-galactosidase or from its own en- 
dogenous promoter. 

20 Once formed, the expression libraries are screened with mouse anti- 7: litoralis DNA polymerase antiserum 

using standard antibody/plaque procedures such as those described by Young and Davis, PNAS (1 983), supra. 

The mouse anti-I litoralis DNA polymerase antiserum used to screen the expression libraries can be pre- 
pared using standard techniques, such as the techniques described in Harlow and Cane, Antibodies: A Labo- 
ratory Manual (1 988) CSH Press, the disclosure of which is incorporated herein by reference. Since most sera 

25 react with E. coli proteins, it is preferable that the 7: litoralis polymerase antisera be preabsorbed by standard 
methods against E. coli proteins to reduce background reactivity when screening expression libraries. Phage 
reacting with anti- T. litoralis polymerase antiserum are picked and plaque purified. Young and Davis, PNAS 
(1983), supra. 

The 7. litoralis DNA polymerase DNA, coding for part or the whole gene, can then be subcloned in, for ex- 
30 ample, pBR322, pBluescript, m13or pUC19. If desired, the DNA sequence can be determined by, for example, 
the Sanger dideoxy chain-terminating method (Sanger, R, Nicklen, S. & Coulson, A.R. PNAS (1977) 74:5463- 
5467). 

Identification of DNA Encoding and Expression of the T. litoralis DNA Polymerase 

35 

Several methods exist for determining that the DNA sequence coding for the T. litoralis DNA polymerase 
has been obtained. These include, for example, comparing the actual or deduced amino-terminal sequence of 
the protein produced by the recombinant DNA to the native protein, or determining whether the recombinant 
DNA produces a protein which binds antibody specific for native 7; litoralis DNA polymerase. In addition, re- 

40 search by Wang, et al., FASEB Journal (1 989) 3:20 suggests that certain regions of DNA polymerase sequenc- 
es are highly conserved among many species. As a result, by comparing the predicted amino acid sequence 
of the cloned gene product with the amino acid sequence of known DNA polymerases, such as human DNA 
polymerase and E. coli phage T4 DNA polymerase, the identification of these islands of homology provides 
strong evidence that the recombinant DNA indeed encodes a DNA polymerase. Once identified, the DNA se- 

45 ' quence coding for the 7: litoralis DNA polymerase, can be cloned into an appropriate expression vector such 
as a plasmid derived from E. co//, for example, pET3A, pBluescript or pUC19, the plasmids derived from the 
Baciilus subtilis such as pUB11 0, pTP5 and pC1 94, plasmids derived from yeast such as pSH1 9 and pSH1 5, 
bacteriophage such as lambda phage, bacteria such as Agrobacterium tumefaciens, animal viruses such as 
retroviruses and insect viruses such as Baculovirus. 

so As noted above, in accordance with the present invention, it has been found that DNA coding for T. litoralis 

DNA polymerase contains two introns: i) an 1614 bp intron or intervening sequence, spanning from nucleotides 
1776 to 3389 in Figure No. 6, and ii) an 1170 bp intron, spanning nucleotides 3534 to 4703 in Figure 6. This 
1170 bp intron codes for an endonuclease and is found to self-splice out in E. coli. Prior to overexpression in 
host cells such as E. coli, it is preferable to delete the DNA sequence coding for both the 1614 and 1170 bp 

55 introns. Even though the 1170 bp intron splices out in E. co//., it has been found that expression vectors which 
do not contain this intron result in increased production of the desired polymerase. 

In general, once an intron has been identified and located within a nucleotide sequence, there are a number 
of approaches known in the art which can be used to delete DNA sequences and therefore splice out an intron 
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in-vitro. One method involves identifying unique restriction enzyme sites in the coding region which are near 
the splice junction or area to be deleted. A duplex oligomer is synthesized to bridge the gap between the two 
restriction fragments. A three-part ligation consisting of the amino end restriction fragment, the bridging oligo 
and the carboxy end restriction fragment yields an intact gene with the intron deleted. 

5 Another method is a modification of the above-described method. The majority of the intron is deleted by 

cutting with restriction enzymes with unique sites within the intron, but close to the coding sequence border. 
The linear plasmid containing a deletion of the majority of the intron is ligated together. Single strand phage 
are generated from the pBluescript vector recombinant by super infection with thefl helper phage IR1. Asingel 
strand oligomer is synthesized with the desired final sequence and is annealed to the partially deleted intron 

10 phage DNA. The remainder of the intron is thus looped out. By producing the original phage in E. coli strain 
CJ236 the Kunkel method of mutagenesis Methods in Enymoiogy 154:367 (1987)) can be used to select for 
the full deleted intron constructs. 

Yet another method which can be used to delete the intron uses DNA amplification. See, for example, Man- 
iatis, et al., Molecular Cloning: A Laboratory Manual, (1989) Vol. 2, 2nd edition, the disclosure of which is herein 

15 incorporated by reference. Briefly, primers are generated to amplify and subsequently join the amino and car- 
boxyl halves of the gene. 

When an intron is deleted in-vitro, using the methods discussed above, the native splice junction may be 
unknown. Accordingly, one skilled in the art would predict that several possible artificial splice junctions exist 
that would result in the production of an active enzyme. 

20 Once the intron is deleted, overexpression of the T. litoratis DNA polymerase can be achieved, for example, 

by separating the 7: titoralis DNA polymerase gene from its endogenous control elements and then operably 
linking the polymerase gene to a very tightly controlled promoter such as a T7 expression vector. See, Rosen- 
berg, et al., Gene (1987) 56:125-135, which is hereby incorporated by reference, insertion of the strong pro- 
moter may be accomplished by identifying convenient restriction targets near both ends of the 7: litoralis DNA 

25 polymerase gene and compatible restriction targets on the vector near the promoter, or generating restriction 
targets using site directed mutagenesis (Kunkel (1984) supra), and transferring the 7: litoralis DNA polymerase 
gene into the vector in such an orientation as to be under transcriptional and translations control of the strong 
promoter. 

T. titoralis DNA polymerase may also be overexpressed by utilizing a strong ribosome binding site placed 
30 upstream of the T. titoralis DNA polymerase gene to increase expression of the gene. See, Shine and Dalgamo, 
Proc. Natl. Acad. Sci. USA (1974) 71:1342-1346, which is hereby incorporated by reference. 

The recombinant vector is introduced into the appropriate host using standard techniques for transforma- 
tion and phage infection. For example, the calcium chloride method, as described by Cohen, S.N., PNAS (1972) 
69:2110 is used for E. coli, the disclosure of which is incorporated by reference. The transformation of Bacillus 
35 is carried out according to the method of Chang, S„ et al., Molecular and General Genetics (1979) 168:111, 
the disclosure of which is incorporated by reference. Transformation of yeast is carried out according to the 
method of Parent, et al., Yeasf (1985) 1:83-138, the disclosure of which is incorporated by reference. Certain 
plant cells can be transformed with Agrobacterium tumefaciens, according to the method described by Shaw, 
C.H., et al., Gene (1983) 23:315, the disclosure of which is incorporated by reference. Transformation of animal 
40 cells is carried out according to, for example, the method described in Virology (1973) 52:456, the disclosure 
of which is incorporated by reference. Transformation of insect cells with Baculovirus is carried out according 
to, for example, the method described in Biotechnology (1988) 6:47, the disclosure of which is incorporated 
herein by reference. 

The transformants are cultivated, depending on the host cell used, using standard techniques appropriate 
45 * to such cells. For example, for cultivating E. coli, cells are grown in LB media (Maniatis, supra) at 30°C to 42°C 
to mid log or stationary phase. 

The T. litoralis DNA polymerase can be isolated and purified from a culture of transformed host cells, for 
example, by either extraction from cultured cells or the culture solution. 

When the 7: litoralis DNA polymerase is to be extracted from a cultured cell, the cells are collected after 
so cultivation by methods known in the art, for example, centrifugation. Then, the collected cells are suspended 
in an appropriate buffer solution and disrupted by ultrasonic treatment, lysozyme and/or freeze-th awing. A crude 
extract containing the T. litoralis DNA polymerase is obtained by centrifugation and/or filtration. 

When the T. litoralis DNA polymerase is secreted into the culture solution, i.e., alone or as a fusion protein 
with a secreted protein such as maltose binding protein, the supernatant is separated from the cells by methods 
55 known in the art 

The separation and purification of the T. litoralis DNA polymerase contained in the culture supernatant or 
the cell extract can be performed by the method described above, or by appropriate combinations of known 
separating and purifying methods. These methods include, for example, methods utilizing solubility such as 
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salt precipitation and solvent precipitation, methods utilizing the difference in molecular weight such as dialysis, 
ultra-filtration, gel-filtration, and SDS-polyacryl amide gel electrophoresis, methods utilizing a difference in elec- 
tric charge such as ion-exchange column chromatography, methods utilizing specific affinity such as affinity 
chromatography, methods utilizing a difference in hydrophobicity such as reverse-phase high performance liq- 
5 uid chromatography and methods utilizing a difference in isoelectric point such as isoelectric focusing electro- 
phoresis. 

One preferred method for isolating and purification of the recombinant enzyme is accomplished using the 
multi-stage process as follows. 

First, the cells, if frozen are thawed, suspended in a suitable buffer such as Buffer A (100 mM NaCI, 25 

10 mM Tris pH 7.5, 0.1 mM EDTA t 10% glycerol, 0.05% Triton X-100), lysed and centrifuged. The clarified crude 
extract is then heated to 75°C for approximately 30 minutes. The denatured proteins are removed by centrifu- 
gation. The supernatant is then passed through a column that has high affinity for proteins that bind to nucleic 
acids such as Affigel Blue column (Biorad). The nucleic acids present in the supernatant solution and many of 
proteins pass through the column and are thereby removed by washing the column with several column vol- 

15 umes with low-salt buffer at pH of about 7.0. After washing, the enzyme is eluted with a linear gradient such 
as 0.1 M to 1.5 M NaCI Buffer A. The active fractions are pooled, dialyzed and applied to a phosphocellulose 
column. The column is washed and DNA polymerase activity eluted with a linear gradient of 0.1 to 1 .0 M NaCI 
in Buffer B (100 M NaCI, 15 mM KP0 4 , 0.1 mM EDTA, 10% glycerol, 0.05% Triton X-100, pH 6.8). The fractions 
are collected and BSA is added to each fraction. The fractions with DNA polymerase activity are pooled. The 

20 T. litoralis DNA polymerase obtained may be further purified using the standard product purification techniques 
discussed above. 

Stabilization and Use of the T. litoralis DNA Polymerase 

25 For long-term storage, the thermostable enzyme of the present invention is stored in the following buffer 

0.05 M NaCI, 0.01 M KPO + (pH 7.4), 0.1 mM EDTA and 50% glycerol at -20°C. 

The T. litoralis DNA polymerase of the present invention may be used for any purpose in which such an 
enzyme is necessary or desirable. For example, in recombinant DNA technology including, second-strand 
cDNA synthesis in cDNA cloning, and DNA sequencing. See Maniatis, et al., supra. 

30 The T. litoralis DNA polymerase of the present invention may be modified chemically or genetically to in- 

activate the 3'-5' exonuclease function and used for any purpose in which such a modified enzyme is desirable, 
e.g., DNA sequencing. 

For example, genetically modified T. litoralis DNA polymerase may be isolated by randomly mutagenizing 
the T. litoralis DNA polymerase gene and then screening for those mutants that have lost exonuclease activity, 
35 without loss of polymerase activity. Alternatively, genetically modified T. litoralis DNA polymerase is preferably 
isolated using the site-directed mutagenesis technique described in Kunkel, T.A., PNAS (1985) 82:488-492, 
the disclosure of which is herein incorporated by reference. 

In addition, the 7: litoralis DNA polymerase of the present invention may also be used to amplify DNA, e.g., 
by the procedure disclosed in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159. 

40 

Construction of Genomic DNA Library and Screening for Thermostable Polymerase from Archaebacteria 
other than T. litoralis 

In accordance with the present invention, cross hybridization of a target Archae bacterium genomic DNA 
45 ' library using an DNA probe prepared from the DNA polymerase gene of T. litoralis and/or cross-reactivity with 
mouse anti-T. litoralis antiserum allows for the identification and isolation of the DNA polymerase genes from 
other archaebacterium, such as Methanococcus, Methanobacter, Methanomicrobium, Halobacter, Thermo- 
plasma, Thermococcus, Pyrococcus, and the like (see, e.g. Woese, C, Microbiological Reviews, pp. 221-270, 
June 1987, the disclosure of which is hereby incorporated by reference). 
so in general, DNA from other archaebacterium can be isolated using the method described above. As with 

T. litoralis The archaebacterium DNA once isolated can be used to construct genomic libraries as either random 
fragments or restriction enzyme fragments. The latter approach is preferred. This approach generally entails 
cutting the target genomic DNA with various restriction enzymes and probing the fragments so formed with, for 
example, a T. litoralis DNA probe. Alibrary is thereafter formed from one or more of the enzymes which produce 
55 a single hybridization band and which are about 4Kb or large enough to at least code for the molecular weight 
of the target DNA polymerase. 

Although methods are available to screen both plasmids and phage using antibodies or DNA probes (Young 
and Davis, PNAS (1983) 80:1194-1198; Maniatis et al, supra) in accordance with the present invention it has 
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been found the phage systems tend to work better and are therefore preferred for the first libraries. 

Genomic libraries can be screened using the colony or plaque hybridization procedure (Maniatis, et al. su- 
pra) or using the antibody plaque DNA procedures. In the colony or plaque hybridization procedure, DNA probes 
may be formed by labelling a polymerase gene from a related organism, for example, T. litoraiis. The genomic 

5 library is hybridized with labeled probe under conditions which depend on the stringency desired, which may 
be experimentally determined in each case as described below. 

Specifically, although each arch ae bacterium will require its own set of hybridization conditions, in order to 
maximize the detectability of the target DNA, several basic approaches can be followed. Optimum hybridization 
conditions and probes can be determined for each target archaebacterium, for example, by performing test 

10 Southern blots at various temperatures. Hybridization is typically carried out in 4X SET, 0.1 M sodium phos- 
phate, pH 7.0, 0.1% Na pyrophosphate, 0.1% SDS, 1X Denhardts solution (Maniatis, supra). Probe selection 
can also vary with respect to size and regions of the 7! litoraiis DNA polymerase gene (Fig. 6). Optimum probes 
can be determined for a target archaebacterium by performing test Southern blots as described above with large 
or small DNA fragments, or even oligomers. One could, for example, select probes that are totally within one 

15 of the intervening sequences of T. litoraiis to screen for intervening sequences in the target archaebacterium's 
DNA polymerase gene, or such probes could be limited to mature polymerase coding regions. 

In general, the DNA probe could be the entire sequence of Figure 6, or a portion thereof. The DNA probe 
should be at least 20 nucleotides in length, preferably at least about 50 nucleotides in length, most preferably 
at least about 150 nucleotides in length. Three such DNA probes which may be used are the 1.3 kb fragment 

20 (nucleotides 1 to 1274 of Figure 6), the 1 .6 kb fragment (nucleotides 1269 to 2856 of Figure 6), and the 1 .9 kb 
fragment (nucleotides 2851 to 4771 of Figure 6). 

As with T. litoraiis, the DNA coding for the target archaebacterium DNA polymerase may also be obtained 
using an antibody/plaque procedure. When genomic expression libraries are screened using the antibody/pla- 
que procedure, since it is uncertain whether the target archaebacterium's control regions will function in E coli, 

25 phage vectors which supply all necessary expression control regions such as Xgt11 and XZap II are preferred 
for antibody screening. By cloning archaebacterium DNA into an appropriate site such as the EcoR I site of 
Xgt11, the archaebacterium's DNA polymerase may be expressed either as a fusion protein with beta- 
galactosidase in Xgt11 and XZapll or from its own endogenous promoter. 

Once formed, the expression libraries can be screened either with anti-archaebacterium DNA polymerase 

30 antiserum from the target archaebacterium or, by antibody against the DNA polymerase of a closely related 
organism (i.e. T. litoraiis, another extreme thermophile) using standard antibody/plaque procedures such as 
those described by Young and David, P/vV4S(1983), supra. 

Using either procedure, the archaebacterium DNA polymerase DNA, coding for part or the whole gene, 
once identified can then be subcloned in, for example, pBR322, pBluescript, M13 or pUC19. If desired, the DNA 

35 sequence can be determined by, for example, the Sanger dideoxy chain-terminating method (Sanger, F., Nick- 
len, S. & Coulson, A.R. PNAS (1977) 74:5463-5467). 

Identification of the DNA Encoding the DNA Polymerase 

40 Once the genomic DNA expression library has been constructed and the target DNA coding for the arch- 

aebacterium DNA has been identified by use of DNA probes or antibody cross- reactivity from 7; litoraiis, one 
may confirm that a DNA polymerase sequence has been obtained as described above for T. litoraiis. The re- 
sulting clone may be sequenced by standard methods such as by Sanger dedioxy sequencing. 

45 ' Identification, Location and Removal of Intervening Sequencing and Overexpression of the DNA Polymerase 

In accordance with another aspect of the present invention, it has been found that the DNA coding for DNA 
polymerases from other archaebacterium also contain one or more intervening nucleotide sequences or introns. 
Moreover, it has been found that not only do such introns share substantial homology with the introns found in 

50 T. litoraiis, they appear to be located in the same positions. More specifically, in accordance with the present 
invention, introns have been identified in the Pol a conserved region motifs in both T. litoraiis and Pyrococcus 
sp. DNA polymerase genes. Without wishing to be bound by theory, it is believed that other archaebacteria also 
possess one or more intervening sequences in the coding region for their DNA polymerases. These introns 
can be identified in two ways. If the intron(s) is related to the intron(s) located in T. litoraiis and/or Pyrococcus 

55 sp. DNA polymerases genes, they can be identified by low stringency hybridization to DNA probes derived from 
the intron sequences of T. litoraiis or Pyrococcus sp. DNA polymerase genes. Secondly, once the archaebac- 
terium DNA polymerase gene has been identified and isolated as described above, its DNA polymerase gene 
can be sequenced at the DNA level and the sequence compared to (1) other DNA polymerases to identify non- 
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similar segments, or (2) conserved motifs to look for the absence of one or more of Regions I- VI, followed by 
identification of interruption points in the Region(s) which are absent 

Once identified, the intron(s) can be removed in vitro by, for example, the techniques described above and 
in the Examples for removal of the two tntrons in the T. Htoraiis DNA polymerase gene, 
5 The following examples are given to illustrate embodiments of the present invention as it is presently pre- 

ferred to practice. It will be understood that the examples are illustrative, and that the invention is not to be 
considered as restricted except as indicated in the appended claims. 

EXAMPLE I 

10 

PURIFICATION OF A THERMOSTABLE DNA POLYMERASE FROM THERMOCOCCUS LITORALIS 

7 litoralis strain NS-C (DSM No. 5473) was grown in the media described by Belkin, et al. supra, containing 
10 g/l of elemental sulfur in a 100 liter fermentor at its maximal sustainable temperature of approximately 80°C 

15 for two days. The cells were cooled to room temperature, separated from unused sulfur by decanting and col- 
lected by centrifugation and stored at -70°C. The yield of cells was 0.8 g per liter. 

183 g of cells obtained as described above, were suspended in 550 ml buffer A (10 mM KP0 4 buffer, pH 
7,4; 1.0 mM EDTA, 1.0 mM beta-mercaptoethanol) containing 0.1 M NaCI and sonicated for 5 minutes at4°C. 
The lysate was centrifuged at 15,000 g for 30 minutes at 4°C. The supernatant solution was passed through 

20 a 470 ml Affigel blue column (Biorad). The column was then washed with 1 000 ml of buffer A containing 0.1 M 
NaCI. The column was eluted with a 2000 ml linear gradient from 0.1 to 2.0 M NaCI in buffer A. The DNA poly- 
merase eluted as a single peak at approximately 1 .3 M NaCI and represented 80% of the activity applied. The 
peak activity of DNA polymerase (435 ml) was dialyzed against 4 liters of buffer A, and then applied to 80 ml 
Phosphocellulose column, equilibrated with buffer A containing 0.1 M NaCI. The column was washed with 160 

25 ml of buffer A containing 0.1 M NaCI, and the enzyme activity was eluted with 1 000 ml linear gradient of 0.1 to 
1 .0 M NaCI in buffer A. The activity eluted as a single peak at 0.6 M NaCI and represented 74% of the activity 
applied. The pooled activity (150 ml) was dialyzed against 900 ml of buffer A and applied to a 42 ml DNA-cel- 
lulose column. The column was washed with 84 ml of buffer A containing 0.1 M NaCI, and the enzyme activity 
eluted with a linear gradient of buffer A from 0. 1 to 1 .0 M NaCI. The DNA polymerase activity eluted as a single 

30 peak at 0.3 M NaCI, and represented 80% of the activity applied. The activity was pooled (93 ml). The pooled 
fractions were dialyzed against 2 liters of buffer A containing 0.05 M NaCI and then applied to a 1.0 ml HPLC 
mono-Q column (Pharmacia). The DNA polymerase activity was eluted with a 100 ml linear gradient of 0.05 M 
to 1.0 M NaCI in buffer A. The DNA polymerase activity eluted as a single peak at 0.1 M NaCI and represented 
16% of the activity applied. The pooled fractions (3.0 ml) were diluted to 6 ml with buffer A and applied to an 

35 1.0 ml HPLC mono-S column (Pharmacia) and eluted with a 100 ml linear gradient in buffer Afrom 0,05 to 1.0 
M NaCI. The activity eluted as a single peak at 0.19 M NaCI and represented 75% of the activity applied. 

By SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and subsequent staining of the proteins using a 
colloidal stain (ISS Problue) more sensitive than Coomassie Blue (Neuhoff, et al., Electrophoresis (1988) 9:255- 
262), it was determined that the DNA polymerase preparation was approximately 50% pure: two major bands 

40 were present, one at 90,000 to 95,000 daltons and a doublet at 18,000 daltons. Figure No. 1A. Avery minor 
band was evident at approximately 80,000 to 85,000 daltons. At this level of purification the polymerase had a 
specific activity of between 30,000 and 50,000 units of polymerase activity per mg of polymerase protein. On 
a separate SDS-polyacrylamide gel verification of the identity of the stained band at 90,000 to 95,000 daltons 
was obtained by cutting the gel lane containing the purified 7: Htoraiis polymerase into 18 slices. Embedded 

45 ' proteins were eluted from the gel by crushing the gel slices in a buffer containing 0. 1 % SDS and 1 00ng/ml BSA. 
The eluted proteins were denatured by exposure to guanidine HCI, then renatured via dilution of the denaturant 
as described by Hager and Burgess Analytical Biochemistry (1980) 109:76-86. Polymerase activity as meas- 
ured by incorporation of radioactivity labeled 32 P-dCTP into acid-insoluble DNA (as previously described) and 
assayed for exonuclease activity (as measured by the release of 3 H-labelled DNA to an acid soluble form as 

50 described in Example V). As shown in Figure No. 1B, only the 90,000 to 95,000 daltons band alone showed 
either significant polymerase activity or exonuclease activity. 

The DNA polymerase preparation was dialyzed against buffer Acontaining 0.05 M NaCI. As was determined 
by SDS-PAGE, much of the 1 8,000 dalton protein precipitated out of the solution. The yield of 7: litoralis DNA 
polymerase was determined to be 0.5 mg by quantitative protein analysis, and this represented 6.5% of the 

55 total activity present in the starting crude extract. 

Purified T. Htoraiis polymerase was electrophoresed and stained with either Coomassie Blue or the colloidal 
stain (ISS Problue) previously described to detect protein. One deeply staining protein band was seen at about 
90,000 to 95,000 daltons; this molecular weight determination was obtained by comparison on the same gel 
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to the migration of the following marker proteins (Bethesda Research Laboratories): myosin, 200,000 daltons; 
phosphorylase B, 97,400 daltons; BSA, 68,000 daltons; ovalbumin, 43,000 daltons, carbonic anhydrase 29,000 
daltons; b-lactogiobulin, 18,400 daltons; lysoyzme 14,300 daltons. 

5 EXAMPLE II 

CLONING OF 7. UTORAUS DNA POLYMERASE GENE 

A, PRODUCTION OF MOUSE ANTI-7 UTORALiS DNA POLYMERASE ANT1SERA 

10 

Immunization of Mice 

A 3 ml solution containing 0.4 mg of polymerase protein (obtained by the method of Example I) was con- 
centrated at 4°C to approximately 0.3 ml and used to inoculate two mice. The purified T. litoralis polymerase 

15 preparation consisted of four bands of approximately 85-95, 75-85, and a doublet of 10-25 kDal on Coomassie 
blue stained SDS-PAGE gels. As shown in Example I, the T. litoralis polymerase is approximately 90-95 kDal. 
Both 7: litoralis polymerase antisera recognize all four proteins present in the immunogen. 

The immunization schedule was as follows: mouse one was immunized intraperitioneally (IP) with 20 ug 
of 7: litoralis polymerase, prepared as above, in Freunds* complete adjuvant (FCA). Seven days later, both mice 

20 were immunized IP with 50 u.g T. litoralis polymerase in FCA. Twenty-seven days later both mice were immun- 
ized IP with 30 ug T. litoralis polymerase for mouse one and 50 u.g 7: litoralis polymerase for mouse two in 
Freunds' incomplete adjuvant. Mouse one was bled two weeks later and mouse two was bled 20 days later. 
Sera was prepared from blood by standard methods (Harlow and Lane, Antibodies: A Laboratory Manual, 
1988). 

25 Anti-7. litoralis polymerase antisera was diluted in TBSTT (20 mM Tris pH 7.5, 1 50 mM NaCI, 0.2% Tween 

20, and 0.05% Triton-X 100) containing 1% BSA, 0.1% NaAzide, 0.1% PMSF. 

Preabsorption of Anti-7^ litoralis Polymerase Antiserum Against E. coli Lysates 

30 Since most sera react with E. coli proteins, T. litoralis polymerase antisera were preabsorbed, using the 

following method, against E. coli proteins to reduce background reactivity when screening libraries or recom- 
binant antigens. E. coli cell paste was thawed and lysed by sonication and soluble protein was bound to Affigel 
10 (Biorad) as described by the manufacturer. 4 ml of E. coli resin were washed two times in TBS (TBSTT with- 
out detergents). 0.35 ml of sera was diluted approximately 1 to 5 in TBSTT, 1 % BSA, 0.1% NaAzide and mixed 

35 with resin overnight at 4°C. The resin was pelleted by centrifugation and washed. The recovered preabsorbed 
sera was at a 1 to 17 dilution and was stored frozen at -20°C until use. 

For screening, preabsorbed sera was diluted as above to a final concentration of 1:200. 

B. IDENTIFICATION OF A PROBE FOR THE 7. UTORAUS POLYMERASE GENE 

Construction of a Lambda gtll Expression Library 

A probe for the T. litoralis polymerase gene was obtained following immunological screening of a lambda 
gtll expression library, 

45 ' 7! litoralis DNA was partially digested as follows: four ug of T. litoralis DNA was digested at 37°C with five 
units of Eco Rl in a 40 uJ reaction using Eco Rl buffer (Eco Rl buffer = 50 mM NaCI, 100 mM Tris pH 7.5, 20 
mM MgCi 2 , 10 mM BME). Three uJ of 100 mM EDTA was added to 15 uJ samples at 30, 45 and 60 minutes. 2 
ug of T. litoralis DNA was digested for 90 minutes at 37°C with 20 units of Eco Rl in 20 uJ reaction using Eco 
Rl buffer and the reaction was stopped by adding 2 pi of 100 mM EDTA. 0.2 \ig of each digest was electro- 

50 phoresed on an agarose gel to monitor the extent of digestion. Approximately 3 ug of T. litoralis DNA Eco Rl 
partials (14 uJ from the 60-minute digest and 1 9 uJ from the 90-minute digest) were pooled to form the "Eco Rl 
pool" and heated at 65°C for 15 minutes. 

0.5 |j.J of the Eco Rl pool were ligated to 0.28 ug of Eco Rl cut, bacterial alkaline phosphatase treated lambda 
gtil DNA in a five uJ reaction using standard ligation buffer (ligation buffer = 66 mM Tris pH 7.5, 1 mM ATP, 1 

55 mM spermidine, 1 0 mM MgCI2, 1 5 mM DTT, and 2 mg/ml gelatin) and 0.5 u.1 T4 DNA ligase (New England Bio- 
labs No. 202). The ligation was performed at 16°C overnight. 4 uJ of this ligation reaction were packaged using 
Gigapack Gold (Stratagene) according to the manufacturers instructions. After incubation at room temperature 
for two hours, the packaged phage were diluted in 500 ul of SM (SM = 100 mM NaCI, 8 mM MgSQ 4 , 50 mM 
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Tris pH 7.5, 0.01% gelatin) plus three drops chloroform. The packaged Eco Rl library was called sample. V6-1 
and consisted of 1.1 x 10 s individual phage. E. coll strain ER1578 was used for phage infection. 

Immunological Screening of Lambda gtll Expression Library 

5 

The initial phage library was screened (Young, R.A. and R.W. Davis Science, (1983) 222:778-782) with a 
1 :200 dilution of the antiserum produced above. 36 phage (V10-22 through V10-55) which reacted with the anti- 
T. litoralis DNA polymerase antiserum were picked and 16 phage were plaque purified. 

The 1 6 antibody positive phage were used to lysogenize £. co//K-1 2 strain Y 1 089. Lysogens were screened 
10 for thermostable DNA polymerase activity, no activity was detected. 

Western blots (Towbin, et al., PNAS, (1979) 76:4350-4354) from these 16 lysates were probed with anti- 
T. litoralis polymerase antiserum. All proteins from these lysates which reacted with T. litoralis polymerase an- 
tiserum were smaller than T. litoralis polymerase, and were also smaller than beta-galactosidase, indicating 
that none were fusion proteins with beta-galactosidase. 
15 Eight of the 16 antibody positive phage were used to affinity purify epitope-specific antibodies from total 

antiserum (Beall and Mitchell, J. Immunological Methods , (1986) 86:217-223). 

The eight affinity purified sera were used to probe Western blots of both purified T. litoralis polymerase and 
T. litoralis crude lysates. Antibody purified from NEB 618 plaques specifically reacted with T. litoralis polymerase 
in purified and T. litoralis crude lysates. This was strong evidence that phage NEB 618 encodes approximately 
20 38 kDal of the amino terminus of the T. litoralis polymerase. 

Characterization of Phage NEB 618 and Subcloning of Eco Rl Inserts 

Western blot analysis indicated that phage NEB 618 synthesized several peptides ranging in size from approx- 

25 imately 15-40 kDal which bound T. litoralis polymerase antisera. DNA from phage NEB 618 was purified from 
liquid culture by standard procedures (Maniatis, et al., supra.) Digestion of NEB 618 DNA with Eco Rl yielded 
fragments of 1.3 and 1.7 kb. An Eco Rl digest of NEB 618 DNA was ligated to Eco Rl cut pBluescript DNA. 20 
ug of pBluescriptSK+ were digested with 40 units of Eco Rl in 40ul Eco Rl buffer at 37°C for three hours, followed 
by 65° for 15 minutes. 10 ug of NEB 618 DNA were digested with 40 units of Eco Rl in 40 ul Eco Rl buffer at 

30 37°C for 75 minutes, followed by 65°C for 15 minutes. 1.75 ug of Eco Rl cut NEB 618 DNA were ligated to 20 
ng Eco Rl cut pBluescriptSK+ with one ul T4 DNA ligase (New England Biolabs No. 202) in 1 0 ul ligation buffer. 
The ligation was performed overnight at 16°C. JM101 CaCI competent cells (Maniatis, et al., supra) were trans- 
formed with 5 ul of the ligation mixture. Of 24 recombinants examined, all but one contained the 1.7 kb fragment; 
clone V27-5.4 contained the 1.3 kb T. litoralis DNA fragment. 

35 Antibodies from T. litoralis polymerase mouse antisera were affinity purified, as described above, on lysates 

from V27-5.4 (encoding the 1 .3 kb Eco Rl fragment) and V27-5.7 (encoding the 1 .7 kb Eco Rl fragment in pBlue- 
script) and reacted with Western blot strips containing either purified or crude 7". litoralis polymerase. Antibodies 
selected on lysates of V27-5.4 reacted with T. litoralis polymerase in both crude and purified preparations. In 
addition, the first three amino acids from the N-terminal protein sequence of native T, litoralis polymerase (me- 

40 thionine-isoleucine-leucine) are the same as in the predicted open reading frame (ORF) in the V27-5.4 clone. 
From these results it was concluded that V27-5.4 encoded the amino terminal of T. litoralis polymerase. 
The 1.3 kb Eco Rl fragment of V27-5.4 comprises nucleotides 1 to 1274 of Figure No. 6. The insert DNA was 
large enough to encode the biggest peptides synthesized by this clone, but not the entire T. litoralis polymerase. 

45 ' C. CONSTRUCTION AND SCREENING OF T. LITORALIS SECONDARY LIBRARIES 

Antibody screening discussed above, had identified the DNA fragment coding the amino terminal half of 
the T. litoralis polymerase. In order to find a fragment large enough to code for the entire gene, restriction digests 
of 7: litoralis DNA were probed with the amino terminal half of the polymerase gene contained in clone V27- 

50 5.4. Restriction digests were performed in separate tubes using a master mix which contained 1.2 ug of 7: li- 
toralis DNA in 39 ul of restriction enzyme buffer (REB, restriction enzyme buffer = 50 mM NaCI, 10 mM Tris pH 
7.5, 20 mM MgCI2, 10 mM BME), to which 1.5-200 U of enzyme were added as followed: 1.5 U Avrll, 9 U Eael, 
10 U Nhel, 20 U Notl, 9 U Spel, 20 U Xhol, 30 U Xbal, 20 U Sad, 10 U BamHI, 20 U Clal, 20 U Hindlll, 20 U 
Pstl, 12 U Nael, 10 U Seal, 12 U Xmnl, 20 U EcoRV, 20 U Sal, 20 U Eco Rl, 200 U Eagl, 20 U Dral, 5 U Hapl, 

55 8 U Nrul, 4 U SnaBI, 8 U Stul, 10 U Bell, 8 U Bglll, 10 U Rsal, 10 U Haelll, 8 U AIul, 4 U Hindi, 10 U Pvull, 6 
U Sspl. One ul 10 mg/ml BSA was added to the Hindi digest. Ball digest was prepared as above except there 
was 0 mM NaCI in the buffer. All digest were overnight at 37°C except Bell which was incubated at 50°C. Digests 
were electrophoresed on agarose gels and transferred to NC (Southern, J. Mot. Biol. (1975) 98:503-517). The 
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filters were probed with radiolabeled V27-5.4 DNAand hybridization was detected by autoradiography. In most 
digests, V27-5.4 DNA hybridized to fragments greater than 20 kb t except BamHI (approximately 14 kb), Eco 
Rl (1.3 kb), Hindlll (approximately 2.4, 5.4 kb), Xbal (approximately 8 kb), Clal (approximately 4.4, 5.5 kb), Ball 
(approximately 8.5 kb), Hindi (approximately 2.1, approximately 2.4 kb), Nrul (approximately 5.5 kb), Bglll (ap- 
5 proximately 2.9 kb), Haelll (approximately 1.3, approximately 1.4 kb) and Rsal which gave numerous small 
bands. 

Digests yielding single fragments large enough to encode the entire polymerase gene, estimated to be 2.4- 
3 kb, based on the size of the native protein, were BamHI, Xbal, and Nrul. 

10 BamHI Library 

A BamHI genomic library was constructed using lambda Dashll. Lambda Dashll is a BamHI substitution 
vector that can be used to clone 1 0-20 kb BamHI DNA fragments. 25-75 nanograms of T. litorafis genomic DNA 
digested with BamHI, as described above, was ligated to 0.5 ug BamHI digested, calf intestine phosphatase 

15 treated lambda Dashll DNA in five ul of standard ligation buffer including 0.5 ul T4 DNAIigase (New England 
Biolabs No. 202). Three uJ of the ligation reaction was packaged (Gigapack Plus, Stratagene) as described 
above. Plaque lifts of 8,000 plaques from the lambda Dashll library were probed with labeled ge! purified 1.3 
kb Eco Rl fragment from clone V27-5.4 (Maniatis, et al., supra). 2.5% of the phage hybridized to the 1.3 kb 
Eco Rl DNA fragment, two of which were plaque purified (clones lambda NEB 619 and lambda V56-9). Both 

20 phage contained a 12-15 kb BamHI fragment which hybridized to the 1.3 kb Eco Rl fragment and contained 
the approximately 8 kb Xbal and approximately 5.5 kb Nrul fragments. The BamHI insert was subcloned into 
pBR322. Colonies containing this fragment grew very poorly and, based on the polymerase assay described 
above, failed to produce detectable levels of thermostable DNA polymerase. 

25 Xbal Library 

T. litoralis DNA digested with Xbal was cloned into the Xbal site of pUC19. Colony lifts were probed with 
radiolabeled V27-5.4 DNA. No positive clones were detected. 

The Xbal fragment from the BamHI insert in lambda NEB 619 (BamHI library above) was subcloned into 

30 the Xbal site of pUC19. Approximately 0.3 ug of NEB 619 DNA digested with BamHI was ligated to 0.1 ug pUC19 
DNA digested with BamHI using two ul T4 DNA ligase (New England Biolabs No. 202) in 20 ul of standard lig- 
ation buffer. The ligation was incubated overnight at 16°C. CaCI 2 competent JM101 and XL-1 cells were trans- 
formed with five uJ of ligation mix and incubated overnight at 37°C (Maniatis, et a!., supra). Colony lifts were 
probed with radiolabeled purified 1 .3 kb Eco Rl fragment from V27-5.4 DNA. No positives were detected. Com- 

35 petent RRI cells were transformed with 10 ul of ligation mix and incubated overnight at 30°C. Micro-colonies 
were picked and mini-plasmid preparations (boiling method, Maniatis, et al., supra) analyzed. Most of these 
clones contained the approximately 8 kb Xbal fragment. The rationale for this latter experiment was that since 
the BamHI clones grew poorly, there would be an increased chance of isolating a plasmid containing the 7: 
litoralis polymerase gene from an Xbal colony that also grew slowly. Also, lower temperature of incubation re- 

40 suits in less copies of pUC1 9 plasmids per cell. These results provided evidence that the T. litoralis polymerase 
gene was toxic to E coli. Using the polymerase activity assay described above, no thermostable polymerase 
activity was detected in these clones. Restriction analysis indicated that the Xbal clones should contain the 
entire polymerase gene. See Figure No. 2. 

45 ' Nrul Libraries 

Approximately 0.3 \ig of NEB 619 DNA (BamHI library above) cut with Nrul was ligated to 0.1 ug of pUC19 
DNA cut with Hindi exactly as described for the Xbal library. Again, no positives were found by hybridization 
when cells were incubated at 37°C, but when transformants were incubated at 30°C t many micro-colonies were 

so observed. The majority of these micro-colonies contained the approximately 5.5 kb Nrul insert. Using the poly- 
merase activity assay described above, no thermostable polymerase activity was detected in these colonies. 
Analysis of these colonies determined that when the direction of 7". Litoralis polymerase transcription was the 
same as lacZ in pUC19, the colonies failed to grow at 37°C and were extremely unstable. 
However, colonies in which the direction of 7^ litoralis polymerase transcription was opposite of lacZ in pUC1 9, 

55 such as in clone Nru21, were more stable. This indicated that transcription of T. litoralis polymerase is detri- 
mental to E. co//, and may explain why it was so difficult to done the entire gene. Restriction mapping analysis 
indicated that the Nrul dones should contain the entire polymerase gene. See Figure No. 2. 
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Conclusions Concerning Direct Cloning of the Polymerase 

The T. litoralis is approximately 90-95 kDal which would require approximately 2.4-3.0 kb DNA to encode 
the entire gene. Restriction mapping analysis of the 1 .3 kb Eco Rl fragment, coding for the amino-terminus of 
5 the T. litoralis polymerase gene, found within the BamHI, Xbal and Nrul clones, discussed above, indicates that 
all three clones contain the entire polymerase gene. All of these larger clones were unstable in E. coil Therefore, 
alternate methods, as discussed below, for cloning the polymerase were tested. 

D. CLONING THE SECOND HALF OF T. LITORALIS POLYMERASE GENE 

w 

It is believed that when the entire T. litoralis polymerase gene was cloned in E. coll while under its endo- 
genous control, mutations in the gene arose. To prevent selection of inactive mutants, the polymerase gene 
was cloned from the T. litoralis genome in 2 or more pieces which should each separably be inactive and there- 
fore not selected against. Restriction mapping of the T. litoralis genome was used to determine which restriction 

15 enzymes would produce fragments that would be appropriate for cloning the second half of the T. litoralis poly- 
merase gene. Although the above data indicates that expression of T. litoralis polymerase was toxic for E. coll 
it was also possible that DNA sequences themselves, in or outside of the coding region, were toxic. Therefore, 
the minimum sized fragment which could encode the entire gene was determined to be the best choice. Re- 
striction analysis indicated that there was an approximately 1.6 kb Eco Rl fragment adjacent to the 3' end of 

20 the amino terminal 1.3 kb Eco Rl fragment (see Figure No. 2) which could possibly complete the polymerase 
gene. 

Hybridization Probe for the Second Half of the T. litoralis DNA Polymerase Gene 

25 Since none of the previous clones expressed thermostable polymerase activity, it was possible that they 

had accumulated mutations in the coding sequence and would therefore not be suitable sources of the second 
half of the gene. Hybridization probes were therefore required in order to clone the downstream fragments from 
the genome. The approximately 3.2 kb Ndel/Clal fragment from clone Nru21 (the Nru21 clone contains an ap- 
proximately 5.5 kb insert, beginning approximately 300 bp upstream from the start of the polymerase gene) 

30 was subcloned into pSP73 (Promega) creating clone NCII. CaCI 2 competent RRI cells were transformed, as 
above, with the ligation mixture. Mini-plasmid preps of transformants were analyzed by digestion with Ndel and 
Clal and clone NCII containing the 7*. litoralis 3.2 kb Ndel/Clal fragment was identified. This clone was stable 
in E. coll The NCII insert was sequenced (Sanger, et al., PNAS, (1 977) 74:5463-5467). The Clal end was iden- 
tical to the V27-5.4 sequence (1.3 kb Eco Rl fragment coding for the amino-terminus of the 7^ litoralis polymer- 

35 ase). The 1 .3 kb Eco Rl junction and beyond was sequenced using primers derived from the 1 .3 kb Eco Rl 
fragment sequence. The Ndel end was sequenced from primers within the vector. 

Screening of Eco Rl Genomic Libraries 

40 10 u.g of NC11 were digested with 30 U of Eco Rl in 100 \i\ of Eco Rl buffer at 37°C for two hours. The 

approximately 1 .6 kb Eco Rl fragment was purified on DE-81 paper (Whatman) after electrophoresis. The ap- 
proximately 1 .6 kb Eco Rl fragment was radiolabeled and used to probe the original Eco Rl lambda gtll library. 
Infection and plaque lifts were performed as above. Three positives were identified and plaque purified. All con- 
tain the approximately 1.6 kb Eco Rl fragment, but some also contain other inserts. 

45 - An Eco Rl library was also constructed in lambda Zapll. 2 u.g of T. litoralis DNA were digested with 20 U 
Eco Rl for five hours at 37°C in 20uJ Eco Rl buffer and then heat treated at 65°C for 1 5 minutes. Approximately 
15 nanograms of T. litoratis DNA/Eco Rl was ligated to 0.5 ng of Eco Rl cut, phosphatased lambda Zapll DNA 
(Stratagene) with 0.5 \x\ T4DNA ligase (New England Biolabs No. 202) in 5 \x\ of ligation buffer at 1 6°C overnight. 
4 u.l of ligated DNA was packaged (GigaPack Gold, Stratagene). Infection and plaque lifts were performed as 

so above. Approximately 1,500 phage were probed with radiolabeled approximately 1.6 kb Eco Rl fragment as 
above. Five hybridization positive plaques were picked and three were plaque purified. Two phage (NEB 620 
and V109-2) were rescued as pBluescript recombinants (V117-1 and V117-2) by in-vivo excision according to 
the manufacturer's instructions (Stratagene). Both contained the approximately 1.6 kb Eco Rl fragment plus 
different second fragments. The 5' end was sequenced and corresponds to the sequence determined from 

55 NC11 (Clal/Ndel fragment). See Figure No. 2. This Eco Rl fragment contains 3/6 of the T4 DNA polymerase 
family homology islands as described by Wang, et aL, supra. The 1 .6 kb Eco Rl fragment comprises nucleotides 
1269 to 2856 of Figure No. 6. 

The sequence of the 1 .6 kb Eco Rl and Clal/Ndel fragments indicated that the 1 .9 kb Eco Rl fragment may 
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be necessary to complete the polymerase gene. Lambda Zapll phage, V110-1 through V110-7, containing the 
1 .9 kb Eco Rl fragment were identified as described above for NEB 620 using labeled probes. Two phage (V1 1 0- 
2 and V110-4) were rescued as pBluescript recombinants (V153-2 and V153-4) by in-vivo excision according 
to the manufacturers instructions (Stratagene). Both contained the approximately 1.9 kb Eco Rl fragment plus 

5 different second fragments. The 1.9 kb Eco Rl fragment had sequence identity with the overlapping region in 
NC11. The 1.9 kb Eco Rl fragment comprises nucleotides 2851 to 4771 of Figure No. 6. 

The entire T. Htoralis polymerase gene has been cloned as BamHI, Xbal and Nrul fragments which were 
unstable and from which the active enzyme was not detected. The gene has also been cloned in four pieces 
(1.3 kb Eco Rl fragment, approximately 1.6 kb Eco Rl fragment, approximately 1.9 kb Eco Rl fragment and an 

10 Eco RI/BamHI fragment containing the stop codon). The 1.3 kb Eco Rl fragment stably expresses the amino 
terminal portion of the polymerase. 

EXAMPLE III 

15 CLONING OF ACTIVE T. UTORALIS DNA POLYMERASE 

The T. Htoralis polymerase gene found on the 14 kb BamHI restriction fragment of bacteriophage NEB619 
(ATCC No. 40795), was sequenced using the method of Sanger, et al., PNAS (1977) 74:5463-5467. 5837 bp 
of continuous DNA sequence (SEQ ID NO:1) was determined beginning from the 5' end of the 1.3 kb EcoRI 

20 fragment (position NT 1), see Figure No. 6. 

From analysis of the DNA sequence, it was determined that the polymerase gene begins at NT 291 in the 
1 .3 kb EcoRI fragment. A translation termination site beginning at NT 5397 was also located. Since the apparent 
molecular weight of T. Htoralis polymerase was approximately 90-95 Kdal, it was predicted that the gene should 
be -2900 bp. Instead, a 5 1 06 bp open reading frame (ORF) was identified with a coding capacity of 1 702 amino 

25 acids (aa) or -185 Kdal. 

By sequence homology with other DNA polymerases, an example of which is set out in Figure No. 7, it was 
discovered that the T. Htoralis polymerase gene was interrupted by an intron or intervening sequence in DNA 
polymerase consensus homology region III (hereinafter "I VS1") (Wang, T, et al., FASEB Journal (1 984) 3:14- 
21 the disclosure of which is herein incorporated by reference). The conserved amino acids of the consensus 

30 DNA polymerase homology region III are shown in Figure No. 7. In the Figure, the conserved amino acids are 
underlined. As can be seen in Figure No. 7, the left side of the T. Htoralis homology island III (SEQ ID NO:2) 
begins at NT 1737, and homology to the consensus sequence is lost after the Asn and Ser residues. The right 
side of the T. Htoralis homology island III (SEQ ID NO:3) can be picked up at NT 3384, at the Asn and Ser re- 
sidues. When the two T. Htoralis polymerase amino acid sequences were positioned so that the Asn and Ser 

35 residues overlap, as in Figure No. 7, it was evident that a good match to the DNA polymerase homology region 
III existed. 

Using the homology data, it was therefore predicted that an intervening sequence existed in the T. Htoralis 
DNA separating the left and right halves of the DNA polymerase homology region III. 

In one preferred embodiment, the intervening sequence was deleted by identifying unique restriction en- 
40 zyme sites in the coding region which were near the intervening sequence splice junction. A synthetic duplex 
oligonucleotide was synthesized, and used to bridge the gap between the two restriction fragments. A multi- 
part sequential ligation of the carboxy end restriction fragments, the bridging oligonucleotide, the amino end 
restriction fragment, and the expression vector, resulted in the formation of an expression vector containing an 
intact polymerase gene with the intervening sequence deleted. 
45 ' Specifically, the DNA fragments or sequences used to construct the expression vector of the present in- 
vention containing the T. Htoralis DNA polymerase gene with the intervening sequence deleted were as follows: 
1. An Ndel site was created by oligonucleotide directed mutagenesis (Kunkel, et al., Methods in Enzomol- 
ogy (1 987) 1 54:367:382) in plasmid V27-5.4 (Example II , Part B) such that the initiation codon of the poly- 
merase coding region is contained within the Ndel site. 

50 

Original sequence . . . TIT ATG . . . 

(nucleotides 283-293) 

5 5 New sequence . . . CAT ATG ... 

Sequences from the newly created Ndel site to the Clal site (approximately 528 base pairs) were util- 
ized in the construction of the expression vector. 
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2. An approximately 899 bp sequence between the Clal and Pvul site of NC11 (Example II, Part D).. 

3. Asynthetic duplex which spans the intervening sequence, connecting Pvul and Bsu36l sites derived from 
other fragments, as set out in Figure No. 12. 

In Figure No. 12, the first line indicates the original sequence at the 5' end of the splice junction (nu- 
5 cleotides 1721-1784, SEQ ID NO:1), the second line indicates the original sequence of the 3' end of the 

splice junction (nucleotides 3375-3415, SEQ ID NO:1), and the third and fourth lines indicate the sequence 
of the synthetic duplex oligonucleotide. 

4. A Bsu361 to BamHI fragment, approximately 2500 base pairs, derived from bacteriophage NEB 619 (Ex- 
ample tl, Part C). 

10 5. A BamHI to Ndel fragment of approximately 6200 base pairs representing the vector backbone, derived 

from pET11c (Studier, Methods in Enzomology, (1990) 185:66-89), and which includes; 

a) The T7 phi 10 promoter and ribosome binding site for the gene 10 protein 

b) Ampicillin resistance gene 

c) lacli gene 

15 d) Plasmid origin of replication 

e) Afour-fold repeat of the ribosomal transcription terminators (rrnb), Simons, eta!., Gene (1987) 53:85- 
96. 

The above DNA fragments, 1-5, were sequentially ligated under appropriate conditions using T4 DNAIig- 
ase. The correct construct was identified by restriction analysis and named pPR969. See Figure No. 8. pPR969 
20 was used to transform E. coli strain RRI, creating a strain designated NEB 687. A sample of NEB 687 was de- 
posited with the American Type Culture Collection on December 7, 1990 and bears ATCC No. 68487. 

In another preferred embodiment, the T. litoralis polymerase gene, with the intervening sequence deleted, 
was cloned into a derivative of the Studier T7 RNA polymerase expression vector pET1 1c (Studier, (1990) su- 
pra). The recombinant plasmid V174-1B1 was used to transform E coli strain BL21 (DE3)pLysS, creating strain 
25 175-1 B1, designated NEB671. See Figure Nos. 5 and 10. 

A sample of NEB671 was deposited with the American Type Culture Collection on October 17, 1990 and 
bears ATCC No. 68447. 

A comparison between the predicted and observed molecular weights of the polymerase, even with the 
IVS1 deleted, revealed a discrepancy. The predicted molecular weight of the polymerase after removal of IVS1 

30 in region III is 1 32 Kb, while the observed molecular weight of either the native (see Example I) or recombinant 
(see Example IV) polymerase is about 95 kD. The molecular weight discrepancy is due to an intron (hereinafter 
"IN^") in homology region I. This finding is based on the following observations: The distance between hom- 
ology regions III and I varies from 15-135 amino acids in members of the pol alpha family (Wang, (1989) supra). 
In T. iitoralis there are 407 amino acids or -44-kD separating these regions. T. litoralis DNA polymerase is very 

35 similar to human pol alpha except for 360 amino acids between conserved homology regions I and III where 
no similarity exists. Finally, no consensus region I is observed. 

In addition, as determined by SDS-PAGE, a thermostable endonuclease of approximately 42-47 kD is also 
produced by the 7*. litoralis DNA polymerase clones of the present invention (see Example X). This endonu- 
clease was purified to homogeneity by standard ion exchange chromatography, and was sequenced at its ami- 

40 no-terminal. The first 30 amino acids of the endonuclease correspond to the amino acids encoded beginning 
at nucleotide 3534 of the polymerase clone (SEQ ID NO:1). This corresponds to the portion of the polymerase 
which lacks homology with other known polymerases This endonuclease does not react with anti-I litoralis 
DNA polymerase antisera. While the exact mechanism by which the endonuclease is spliced out of the poly- 
merase is unknown, it occurs spontaneously in both E. coli and T. litoralis. 

45 ' 

EXAMPLE IV 

PURIFICATION OF RECOMBINANT T. LITORALIS DNA POLYMERASE 

50 E. coli NEB671 (ATCC No. 68447) was grown in a 100 liter fermentor in media containing 10 g/liter tryptone, 
5 g/liter yeast extract, 5 g/liter NaCI and 100 mg/liter ampicillin at 35°C and induced with 0.3 mM IPTG at mid- 
exponential growth phase and incubated an additional 4 hours. The cells were harvested by centrifugation and 
stored at -70°C. 

580 grams of cells were thawed and suspended in Buffer A (100 mM NaCI, 25 mM KP0 4 at pH 7.0, 0.1 
55 mM EDTA, 0.05% Triton X-100 and 10% glycerol) to a total volume of 2400 ml. The cells were lysed by passage 
through a Gaulin homogenizer. The crude extract was clarified by centrifugation. The clarified crude extract 
volume was adjusted to 2200 mis with the above buffer and was heated to 75°C for 30 minutes. The particulate 
materia! was removed by centrifugation and the remaining supernatant contained about 3120 mg of soluble 
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protein. 

The supernatant was applied to a DEAE-sepharose column (5x13 cm; 255 ml bed volume) linked in series 
to a phosphocellulose column (5 x 11 cm; 216 ml bed volume). The DEAE-sepharose flow-through fraction, 
containing the bulk of the enzyme, passed immediately onto the phosphocellulose column. Both columns were 
5 washed with 300 mis Buffer A, the two columns were disconnected, and the protein on the phosphocellulose 
column was eluted with a 2 liter linear gradient of NaCI from 0.1 M to 1 M formed in Buffer A. 

The column fractions were assayed for DNA polymerase activity. Briefly, 1-4 yd of fractions were incubated 
for 5-10 minutes at 75°C in 50 ul of 1X T. litoralis DNA polymerase buffer (10 mM KCI, 20 mM Tris-HCI (ph 8.8 
at24°C), 10 mM (NH 4 ) 2 S0 4 , 2 mM MgS0 4 and 0.1% Triton X-100) containing 30 u.M each dNTP and 3 H-labeled 
10 TTP, 0.2 mg/ml activated calf thymus DNA and 100 ug/ml acetylated BSA, although it has been found that non- 
acetylated BSA is preferred. The mixtures were applied to Whatman 3 mm filters and the filters were subjected 
to three washes of 10% TCA followed by two washes of cold isopropanol. After drying of the filters, bound ra- 
dioactivity representing incorporation of 3 H-TTP into the DNA was measured. The active fractions were pooled 
and the enzyme activity levels in each pool were assessed using the above assay conditions except the dNTP 
15 level was raised to 200 uM each dNTP. Under these conditions one unit of enzyme activity was defined as the 
amount of enzyme that will incorporate 1 0 nmoles of dNTP into acid-insoluble material at 75°C in 30 minutes. 

The active fractions comprising a 300 ml volume containing 66 mg protein, were applied to a hydroxy la patite 
column (2.5 x 5 cm; 25 ml bed volume) equilibrated with Buffer B (400 mM NaCI, 10 mM KP0 4 at pH 7.0, 0.1 
mM EDTA, 0.05% Triton X-1 00 and 1 0% glycerol). The protein was eluted with a 250 m! linear gradient of KP0 4 
20 from 10 mM to 500 mM formed in Buffer B. The active fractions, comprising a 59 ml volume containing 27 mg 
protein, was pooled and dialyzed against Buffer C (200 mM NaCI, 10 mM Tris-HCI at pH 7.5, 0.1 mM EDTA, 
0.05% Triton X-100 and 10% glycerol). 

The dialysate was applied to a heparin-sepharose column (1 .4 x 4 cm; 6 ml bed volume) and washed with 
20 ml Buffer C. A 100 ml linear gradient of NaCI from 200 mM to 700 mM formed in Buffer C was applied to the 
25 column. The active fractions, comprising a 40 ml volume containing 16 mg protein was pooled and dialyzed 
against Buffer C. 

The dialysate was applied to an Affi-gel Blue chromatography column (1 .4 x 4 cm; 6 ml bed volume), wash- 
ed with 20 ml Buffer C, and the protein was eluted with a 95 ml linear gradient from 0.2 M to 2 M NaCI formed 
in Buffer C. The active fractions, comprising a 30 ml volume containing 11 mg of protein, was dialyzed against 
30 a storage buffer containing 200 mM KCI, 10 mM Tris-HCI (pH 7.4), 1 mM DTT, 0.1 mM EDTA, 0.1% Triton X- 
100, 100 ^g/ml BSA and 50% glycerol. 

The T. litoralis DNA polymerase obtained above had a specific activity of 20,000-40,000 units/mg. 

Characterization of recombinant T. litoralis polymerase 

35 

Recombinant and native T. litoralis polymerase had the same apparent molecular weight when electrophor- 
esed in 5- 10% SDS-PAGE gradient gels. Recombinant T. litoralis polymerase maintains the heat stability of 
the native enzyme. Recombinant T. litoralis polymerase has the same 3'— >5' exonuclease activity as native 
T. litoralis polymerase, which is also sensitive to inhibition by dNTPs. 

40 

EXAMPLE V 

OVER-EXPRESSION OF THE THERMOCOCCUS LITORALIS DNA POLYMERASE GENE 

45 < The T. litoralis DNA polymerase gene, with IVS1 deleted, e.g., V1 74-1 B1 obtained in Example Ml, may be used 
in a number of approaches, or combinations thereof, to obtain maximum expression of the cloned T. litoralis 
DNA polymerase. 

One such approach comprises separating the T. litoralis DNA polymerase gene from its endogenous control 
elements and then operably linking the polymerase gene to a very tightly controlled promoter such as a T7 ex- 

50 pression vector (Rosenberg, et al., Gene (1987) 56:125-1 35). Insertion of the strong promoter may be accom- 
plished by identifiying convenient restriction targets near both ends of the T. litoralis DNA polymerase gene and 
compatible restriction targets on the vector near the promoter, or generating restriction targets using site di- 
rected mutagenesis (Kunkel, (1984), supra), and transferring the T. litoralis DNA polymerase gene into the vec- 
tor in such an orientation as to be under transcriptional and translational control of the strong promoter. 

55 I litoralis DNA polymerase may also be overexpressed by utilizing a strong ribosome binding site placed 

upstream of the 7". litoralis DNA polymerase gene to increase expression of the gene. See, Shine and Dalgamo, 
Proc. Natl. Acad. Sci. USA (1974) 71:1342-1346, which is hereby incorporated by reference. 

Another approach for increasing expression of the T. litoralis DNA polymerase gene comprises altering the 
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DNA sequence of the gene by site directed mutagenesis or resynthesis to contain initiation codons that are 
more efficiently utilized than E. coli. 

Finally, 7: iitoralis DNA polymerase may be more stable in eukaryote systems like yeast and Baculovirus. 

The T. Iitoralis DNA polymerase may be produced from clones carrying the T. Iitoralis DNA polymerase gene 
5 by propagation in a fermentor in a rich medium containing appropriate antibiotics. Cells are thereafter harvested 
by centrifugation and disrupted by sonication to produce a crude cell extract containing the T. Iitoralis DNA poly- 
merase activity. 

The crude extract containing the 7: Iitoralis DNA polymerase activity is purified by the method described 
in Example I, or by standard product purification techniques such as affinity-chromatography, or ion-exchange 
10 chromatography. 

EXAMPLE VI 

PRODUCTION OF A T. LITORALIS DNA POLYMERASE 3' TO 5' EXONUCLEASE MUTANT 

T. Iitoralis DNA polymerase lacking 3' to 5' exonuclease activity was constructed using site-directed mu- 
tagenesis to alter the codons for asp141 and glu143 to code for alanine. Site-directed mutagenesis has been 
used to create DNA polymerase variants which are reported to have reduced exonuclease activity, including 
phi29 (Ce//(1989) 59:219-228) DNA polymerase I (Science (1988) 240:199-201) and T7 DNA polymerases 
20 (U.S. Patent No. 4,942,130). 

Site-directed mutagenesis of the polymerase of the present invention was accomplished using a modifi- 
cation of the technique described by Kunkel, T.A., PNAS (1985) 82:488-492, the disclosure of which is herein 
incorporated by reference. The V27-5.4 plasmid (see Example 2, Part B) was used to construct the site-directed 
mutants. V27-5.4 encodes the 1 .3 kb EcoRI fragment in pBiuescript Sk+. E. coli strain C J236 (Kunkel, et at., 
25 Methods in Enymology (1987) 154:367-382), a strain that incorporates deoxyuracil in place of deoxythymidine, 
containing the V27-5.4 plasmid was superinfected with the f1 helper phage IR1 (Virology, (1982) 122:222-226) 
to produce single stranded versions of the plasmid. 

Briefly, the side-directed mutants were constructed using the following approach. First, a mutant oligonu- 
cleotide primer, 35 bases in length, was synthesized using standard procedures. The oligonucleotide was hy- 
30 bridized to the single-stranded template. After hybridization the oligonucleotide was extended using T4 DNA 
polymerase. The resulting double-stranded DNA was converted to a closed circular dsDNA by treatment with 
T4 DNA Ngase. Plasmids containing the sought after mutations were identified by virtue of the creation of a 
Pvul site overlapping the changed bases, as set out below. One such plasmid was identified and named pAJG2. 
The original and revised sequences for amino acid residues are 141, 142, and 143: 
35 . . asp ile glu 

Original : . . GAT ATT GAA 



40 Altered: . . G CG ATC G CA 

The newly created Pvul site, used to screen for the alteration, is underlined. Note that the middle codon was 
changed but that the amino acid encoded by this new codon is the same as the previous one. 

An approximately 120 bp Clal to Ncol fragmentfrom V174-1B1 (see Example III) was replaced by the cor- 
45 - responding fragment bearing the above substitutions from pAJG2, creating pCAS4 (see Figure No. 9). pCAS4 
thus differs from V 174-1 B1 by 4 base pairs, namely those described above. 

E. coli BL21 (DE3) plysS (Methods in Enzomology, (1990) 185:60-89) was transformed with pCAS4, cre- 
ating strain NEB681. Expression of the mutant T. Iitoralis polymerase was induced by addition of IPTG. 

A sample of NEB681 has been deposited with the American Type Culture Collection on November 8, 1 990, 
50 and bears ATCC No. 68473. 

Relative exonuclease activities in the native T. Iitoralis DNA polymerase and the exonuclease minus variant 
isolated from E. coli NEB681 was determined using a uniformly PH] labeled E. coli DNA substrate. Wild type 
T. Iitoralis DNA polymerase was from a highly purified lot currently sold by New England Biolabs, Inc. The ex- 
onuclease minus variant was partially purified through DEAE sepharose and phosphocellulose columns to re- 
55 move contaminants which interfered with the exonuclease assays. The indicated number of units of POLYMER- 
ASE were added to a 0.1 ml reaction containing T. Iitoralis DNA polymerase buffer [20 mM Tris-Hcl (pH8,8 at 
25°C), 10 mM KCI, 10 mM (NH 4 ) 2 S0 4 , 5 mM MgS0 4 , 0.1% Triton X-100], 0.1 mg/ml bovine serum albumin, 
and 3 ng/ml DNA substrate (specific activity 200,000 cpm/ng) and the reaction was overlaid with mineral oil to 
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prevent evaporation of the reaction. Identical reactions contained in addition 20 uM dNTP, previously shown 
to inhibit the exonuclease activity of the wild type enzyme. The complete reaction mixture was incubated at 
70°C for 60 minutes, following which 0.08 ml was removed and mixed with 0.02 ml 0.5 mg/ml sonicated herring 
sperm DNA (to aid in precipitation of intact DNA) and 0.2 ml of 10% trichloroacetic acid at 4°C. After mixing, 

5 the reaction was incubated on ice for 5 minutes, and the DNA then pelleted at 4°C for 5 minutes in an Eppendorf 
centrifuge. 0.25 ml of supernatant was mixed with scintillation fluid and counted. The results of the sample 
counting, corrected for background, are shown in Figure No. 11, 

As illustrated in Figure No. 1 1 , the exonuclease minus variant was substantially free of exonuclease activity 
in the presence or absence of dNTPs under conditions where the native polymerase clearly demonstrated ex- 

w onuclease activity. Conservatively estimating that a level of activity two-fold above background could have been 
detected, this implies that the exonuclease activity is decreased at least 60-fold in this variant 

EXAMPLE VII 

15 T. LITORALIS DNA POLYMERASE HALF-LIFE DETERMINATION 

The thermostability or hatf-life of the T. litoralis DNA polymerase purified as described above in Example 
1 was determined by the following method. Purified T. litoralis DNA polymerase (25 units) was preincubated at 
100°C in the following buffer: 70 mM tris-HCI (pH 8.8 at 25°C), 17 mM ammonium sulfate, 7 mM MgCI 2 , 10 mM 

20 beta-mercaptoethanol, 200nM each deoxynucleotide and 200 u,g/m! DNAse-treated DNA. An initial sample was 
taken at time zero and a small aliquot equivalent to 5% of the enzyme mixture was removed at 10, 20, 40, 60, 
90, 120, 150, and 180 minutes. The polymerase activity was measured by determining incorporation of deox- 
ynucleotide into DNA as described previously. 

A sample of Taq DNA polymerase obtained from New England Biolabs was subjected to the above assay. 

25 An initial sample was taken at time zero and a small aliquot equivalent to 5% of the enzyme mixture was re- 
moved at 4, 7, and 10 minutes. As shown in the Figure No. 3, the half-life of the T. litoralis DNA polymerase at 
100°C was 60 minutes, while the haif-life of the Taq polymerase at 100°C was 4.5 minutes. 

As shown in Figure No. 3A, the half-life of T. litoralis DNA polymerase at 100°C in the absence of stabilizers 
was 60 minutes, while in the presence of the stabilizers TRITON X-100 (0.15%) or BSA (100u.g/m!) the half- 

30 life was 95 minutes. This was in stark contrast to the half-life of Taq DNA polymerases at 100°C ( which in the 
presence or absence of stabilizers was 4.5 minutes. 

The thermostability or half-life of recombinant 7^ litoralis DNA polymerase purified as describe above in Ex- 
ample IV was found to have a biphasic heat inactivation curve at temperatures greater than about 90°C. These 
two phases were characterized by half-lives of about 5 minutes and 7 hours (Fig. 3B). To provide more con- 

35 sistent behavior at extreme temperatures, an additional purification step may be used to eliminate the more 
heat sensitive component of the polymerase. 

Specifically, the final enzyme preparation of Example IV was heated at 100°C for 15 minutes then cooled 
on ice for 30 minutes. Precipitated proteins were removed by centifugation at 12,000 xg for 10 minutes at 4°C. 
Approximately 20% of the initial polymerase activity was lost in this procedure. The remaining DNA polymerase 

40 showed a monophasic heat inactivation profile, with a half-life at 95°C of about 7 hours. The resulting polymer- 
ase also showed kinetic characteristics at 75°C which were similar to the native enzyme and to the recombinant 
enzyme prepared in accordance with Example IV. 

EXAMPLE VIII 

45 ' 

DETERMINATION OF 3'-5' PROOFREADING ACTIVITY 

1. Response of T. litoralis DNA Polymerase to the Absence or Presence of Deoxynucleotides 

so The levels of exonuclease activities associated with polymerases show very different responses to deox- 

ynucleotides. Nonproofreading 5'-3* exonucleases are stimulated tenfold or greater by concomitant polymeri- 
zation afforded by the presence of deoxynucleotides, while proofreading 3'-5' exonucleases are inhibited com- 
pletely by concomitant polymerization. Lehman, I.R. ARB (1967) 36:645. 

The T. litoralis DNA polymerase or polymerases with well-characterized exonuclease functions (T4 Poly- 

55 merase, KI enow fragment) were incubated with 1 ug 3 H-thumidine-labeled double-stranded DNA(10 5 CPM/ng) 
in polymerization buffer (70 mM tris (pH 8.8 at 24°C), 2 mM MgCI 2 , 0.1% Triton and 100 ug/ml bovine serum 
albumin). After an incubation period of three hours (experiment 1) or four hours (experiment 2) at either 70°C 
(thermophilic polymerases) or 37°C (mesophilic polymerases), the exonuclease-hydrolyzed bases were quan- 
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tified by measuring the acid-soluble radioacttvely-labeled bases. 

As shown in Table 1 , the Taq DNA polymerase, with its 5'- 3' exonuclease activity, shows stimulation of ex- 
onuclease activity when deoxynucleotides were present at 30 uM. However, polymerases with 3'-5' proofread- 
ing exonuclease activities, such as theT4 polymerase, Klenow fragment of E. coli polymerase I, or the T. fitoralis 
s DNA polymerase showed the reverse, an inhibitory response to the presence of deoxynucleotides. 

The similarity of responses to the presence or absence of deoxynucleotides of the T. litoralis DNA poly- 
merase and the well-characterized Klenow fragment of the E. coli DNA polymerase is further shown in Figure 
No. 4. Twenty units of 
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either polymerase was incubated with 9 ug 3 H-thymidine-Iabeled double-stranded DNA (10 s CPM/ug) in 350 
\x\ polymerization buffer as described above in the presence, or absence of, 30 uM deoxynucleotides. At each 
time point, 50 uJ was removed and the level of acid-soluble radioactively-labeled bases were measured. As 
Figure No. 4 documents, the behavior of 7: litorah's DNA polymerase and the Klenow fragment of E. coli DNA 
polymerase, which contains a well-characterized 3'-5* proofreading exonuclease activity, are very similar. 

2. Response of T. IHoralis DNA Polymerase to Increasing Deoxy nucleotide Concentrations 

Exonuclease activities of polymerases are affected by the level of deoxynucleotides present during poly- 
merization, in as much as these levels affect polymerization. As deoxynucleotide levels are increased towards 
the Km (Michaelis constant) of the enzyme, the rate of polymerization is increased. For exonuclease functions 
of polymerases sensitive to the rate of polymerization, changes in exonuclease activity are parallel with increas- 
es in deoxynucleotide concentrations. The increase in polymerization rate drastically decreases proofreading 
3'-5' exonuclease activity with a concomitant increase in polymerization-dependent 5'-3' exonuclease activity. 

The exonuclease function of the T. litoralis DNA polymerase was compared to those of well-characterized 
exonuclease functions of other polymerases as the deoxynucleotide concentration was increased from 10 uM 
to 1 00 uM. The exonuclease activity was measured as described in (1 ) with an incubation period of 30 minutes. 
As summarized in Table 2, the I litoralis DNA polymerase responded to increases in deoxynucleotide levels 
similarly to a polymerase known to possess a 3'-5* proofreading exonuclease (Klenow fragment of E. coli DNA 
Pol. I). This 
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response was in contradiction to that of a polymerase known not to possess this proofreading function, Taq 
DNA polymerase. This polymerase responded to an increase in deoxynucleotide levels with an increase in ex- 
onuclease function due to its 5'-3' exonuclease activity. 
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3. Response of T. litoralis DNA Polymerase to Alteration from a Balanced Peoxynucleotide State to an Un- 
balanced State 

Polymerization is dependent on equal levels of all four deoxynucleotides present during DNA synthesis. If 
5 the deoxynucleotide levels are not equal, polymerases have decreased polymerization rates and are more likely 
to insert incorrect bases. Such conditions greatly increase proofreading 3-5' exonuclease activities while de- 
creasing 5-3' exonuclease activities. Lehman, I.R., ARB (1967) 36:645). 

The T. litoralis DNA polymerase was incubated with both balanced deoxynucleotide levels (30 uM) and two 
levels of imbalance characterized by dCTP present at 1/10 or 1/100 the level of the other three deoxynucleo- 
10 tides. The response of the T. litoralis DNA polymerase was then compared to that of three polymerases pos- 
sessing either the 3'-5' or the 5-3' exonuclease functions. All assays were performed as described in (1) except 
the dCTP concentrations listed below. As seen in Table 3 below, the T. litoralis DNA polymerase follows the 
expected behavior for a proofreading 3-5' exonuclease-containing polymerase; an imbalance in deoxynucleo- 
tide pools increased the exonuclease activity in a similar manner as that of the proofreading polymerases of 
15 T4 DNA polymerase or Klenow fragment of E. coli DNA polymerase I. In contrast to this response, the exonu- 
clease of the Taq DNA polymerase was not affected until the imbalance was heightened to the point that poly- 
merization was inhibited. 
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4. Directionality of Exonuclease Activity 

A proofreading exonuclease has a 3'-5' directionality on DNA while non proofreading exonuclease associ- 
ated with DNA polymerases have a 5'-3' directionality. To discern the direction of the exonuclease activity of 

5 T. litoralis DNA polymerase, the 5' blocked DNA of adenovirus was utilized. Since the 5' end of this DNA is 
blocked by protein, enzymic activities that are 5*-3' in directionality cannot digest this double-stranded DNA; 
however, enzymic activities that are 3-5', such as exonuclease III or proofreading exonuclease-containing poly- 
merases, can digest adenovirus DNA. 

Twenty-five units of exonuclease III or 20 units of either T. litoralis DNA polymerase, T4 DNA polymerase 

10 (possessing a well characterized 3'- 5' exonuclease activity), or Taq DNA polymerase (lacking such an activity) 
were incubated with 5\ig adenovirus DNA for time periods up to 30 minutes duration at either 37°C (T4 poly- 
merase and exonuclease III) or 70°C (Taq polymerase and I litoralis polymerase) in the presence of 70 mM 
tris-HCI pH 8.8 at 25°C, 2 mM MgCI 2 and 100 u.g/ml BSA. At the end of each incubation time period, enzymic 
activity was stopped by phenol extraction of the adenovirus DNA, followed by Hpal digestion for one hour at 

15 37°C in 20 mM tris, pH 7.9 at 25°C, 10 mM Magnesium acetate 50 mM potassium acetate and 1 mM DTT. The 
DNA fragments were subjected to agarose gel electrophoresis and the resulting pattern of time-dependent deg- 
radation and subsequent loss of double-stranded DNA fragments were assessed. 

The 3'-5' exonuclease activities of exonuclease III, of T. litoralis DNA polymerase and T4 DNA polymerase 
caused the disappearance of the double-strand DNA fragments originating from the 5' blocked end of the ade- 

20 novirus DNA, indicating vulnerability of its 3' end. In contrast, the Taq DNA polymerase with its 5'-3' polymer- 
ization-dependent exonuclease activity, showed no disappearance of the DNA fragment. 

EXAMPLE IX 

25 PERFORMANCE OF T. litoralis DNA POLYMERASE IN THE PCR PROCESS 

The ability of the T. litoralis DNA polymerase to perform the polymerase chain reaction (PCR) was also 
examined. In 1 00 uJ volumes containing the buffer described in Example IV, varying amounts of M13mp18 DNA 
cut by Clal digestion, generating 2 fragments of 4355 bp and 2895 bp, were incubated with 200 ng of calf thymus 
30 DNA present as carrier DNA to decrease any nonspecific adsorption effects. The forward and reverse primers 
were present at 1 u-M (forward primer = 5'd(CCAGCAAGGCCGATAGTTTGAGTT)3' and the reverse primer = 
5* d(CGCCAGGGTTTTCCCAGTCACGAC)3'). These primers flank a 1 kb DNA sequence on the 4355 bp frag- 
ment described above, with the sequence representing 14% of the total M13mp18 DNA. Also present were 200 
u-M each dNTP, 100 jig/ml BSA, 10% DMSO and 2.5 units of either T. aquaticus DNA polymerase (in the pres- 
35 ence or absence of 0.5% NP40 and 0.05% Tween 20), or 7: litoralis DNA polymerase (in the presence or ab- 
sence of 0.10% Triton X-100). The initial cycle consisted of 5 min at 95°C, 5 min at 50°C (during which poly- 
merase and BSA additions were made) and 5 min at 70°C. The segments of each subsequent PCR cycle were 
the following: 1 min at 93°C, 1 min at 50°C and 5 min at 70°C. After 0, 13, 23 and 40 cycles, 20 uJ amounts of 
100 \i\ volumes were removed and subjected to agarose gel electrophoresis with ethidium bromide present to 
40 quantitate the amplification of the 1 kb DNA sequence. 

Initial experiments with this target DNA sequence present at 28 ng and 2.8 ng established the ability of the 
T. litoralis DNA polymerase to catalyze the polymerase chain reaction; yields were comparable or not more than 
twofold greater than the seen with 7: aquaticus DNA polymerase. 

However, it was at the lower levels of target DNA sequence, 2.8femtograms, that differences in polymerase 
45 ' function were most apparent Under these conditions requiring maximal polymerase stability and/or efficiency 
at elongation of DNA during each cycle,the T. litoralis DNA polymerase produced greater than fourfold more 
amplified DNA than that of 7*. aquaticus DNA polymerase within 23 cycles. 

This ability to amplify very small amounts of DNA with fewer cycles is important for many applications of 
PCR since employing large cycle numbers for amplification is associated with the generation of undesirable 
50 artifacts during the PCR process. 

EXAMPLE X 

PURIFICATION OF RECOMBINANT T. LITORALIS INTRON-ENCODED ENDONUCLEASE 

55 

E. coli NEB671 (ATCC No. 68447), grown as described in Example IV, were thawed (70 grams) and sus- 
pended in Buffer A containing 200 ng of lysozyme per ml to a final volume of 300 ml. The mixture was incubated 
at 37°C for 2 minutes and then 75°C for 30 minutes. The heated mixture was centrifuged at 22,00 x g for 30 
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minutes and the supernatant was collected for further purification of the thermostable endonuclease. Since all 
of the nucleases from E. coli were inactivated by the heat treatment, the preparation at this stage could be used 
for characterization of the intron-encoded endonuclease. To separate this enzyme from the recombinant T. Ih 
(oralis DNA polymerase also present in the 75°C supernatant solution, the solution was passed through a 

5 DEAE-sepharose column (5 cm x 5 cm, 100 ml bed volume) and washed with 200 ml of Buffer A. Essentially 
all of the DNA polymerase activity passes through the column while the endonuclease activity sticks. The en- 
donuclease activity was eluted with a one liter linear gradient of NaCI from 0.1 M to 0.8 M formed in Buffer A. 
The endonuclease activity eluted at about 0.4 M NaCI, and was assayed in a buffer containing 10 mM KCI, 20 
mM Tris-HCI (pH 8.8 at 24°C), 10 mM (NH4) 4 S0 4 , 10 mM MgS0 4( 0.1% Triton X-100 and 1 ug of pBR322 DNA 

10 per 0.05 ml of reaction mixture. The reaction mixture was incubated at 75°C and the extent of DNA cleavage 
was determined by agarose gel electrophorese. At lower temperatures little or no endonuclease activity was 
detected. The tubes containing the peak activity were pooled, dialyzed overnight against Buffer A and then ap- 
plied to phosphocellutose column (2.5 cm x 6.5 cm, 32 ml bed volume), washed with Buffer A and the endonu- 
clease activity eluted with a linear gradient of Nad from 0.1 M to 1.5 M formed in Buffer A. The enzyme eluted 

15 at about 0.8 M NaCI. Active fractions were pooled and dialyzed overnight against Buffer A and then passed 
through a HPLC Mono-S column (Pharmacia) and eluted with a linear gradient of NaCI from 0.05 M to 1 .0 M. 
The activity eluted as a single peak and was homogeneous by SDS-PAGE: a single 42-47 kd band was detected 
by Commasie blue staining and when this band was eluted from the gel and renatured it contained the only 
endonuclease activity detected on the gel. 

20 The enzyme has preferred cutting sites on various DNAs. When used in vast excess and in Vent polymer- 

ase buffer (New England Biolabs, Beverly, MA), the enzyme has cutting sites on lambda DNA and 3 sites on 
pBR322. Two of the rapid sites on pBR322 have been sequenced: 
Region including cut site at position 164: 

25 

'5' TTGGTTATGCCGGTAC TGCCGGCCTCTT 3' 
3' AACCAATACGGC CATGACGGCCGGAGAA 5' 

30 

Region including cut site at position 2411: 

5 ' TTGAGTGAGCTGATAC CGCTCGCCGCAG 3' 
35 3' AACTCACTCGAC TATGGCGAGCGGCGTC 5' 

When IVS2 was deleted from pPR969, the resultant plasmid, pAKK4 (Example XI) now contains a very 
sensitive fast site at the exon junction: 
40 Region including the cut site at IVS2 junction: 

5' GGTTCTTTATGCGGAC *AC /TGACGGCTTTATG 3' 
3* CCAAGAAATACGCC/TG*TGACTGCCGAAATAC 5' 

45 ' 

The astericks denote the boundary between the left exon and the right exon which have been brought to- 
gether by deletion of IVS2. 

Cleavage at the l-TIi I homing site occurs 100-fold more rapidly that at the "star" sites using reaction con- 
ditions of 50 mM TRIS, (pH 7.9), 10 mM MgCI 2 , 100mM NaCI and 1 mM DTT at 50°C. Under these conditions, 
50 the enzyme cut E. coli DNA 6-10 times. "Star" cleavage is enhance by NH 4 (10 mM), higher temperatures (70- 
80°C), and higher pH (8.8-1 0). 

Thus, the endonuclease from T. IHoralis resembles other intron encoded endonucleases reported in that 
there is often a four base 3' extension at the cut site and there can be degeneracy in the recognition sequence. 

The cut site in the intron minus gene is referred to as the homing site of the intron encoded endonuclease. 
55 It is believed in the art that the intron encoded endonuclease recognizes its cut site in the gene lacking the 
intron, and that the cutting of that DNA by the endonuclease leads to insertion of the intron at the homing site. 

The thermostable endonuclease of the present invention can be used in genetic manipulation techniques 
where such activity is desired. 
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EXAMPLE XI 

Construction of T. titoralis DNA Polymerase Expression Vectors with a Deleted IVS2 

Analysis of the deduced amino acid sequence of the T. Irtoraiis gene in comparison to other alpha class 
DNA polymerases and to the endonuclease in the 1170 bp intervening sequence suggested that this intron in- 
terrupted the alpha polymerase Region I. If the first 3 amino acids preceding the endonuclease (Tyr AJa Asp) 
were joined to the Thr at aa 1472, then a good consensus Region I would be established (where underlined 
residues indicate identity): 

Region I: 
Left junction: 
Right junction: 



To facilitate this construction, a Seal site was created in the PCR primers by changing the codon usage 
for Lys 1076 and Val 1077 as follows: 

Amino acids: PHE LYS VAL LEU TYR ALA ASP 

Original sequence: 
Altered sequence: 
Sea I site: 



TYR 


GLY 


AS? 


THR 


ASP 


ScR 


TYR 


ALA 


ASP 


SER 


VAL 


SER 


VAL 


HIS 


ASN 


THR 


ASP 


GLY 


I: TYR 


ALA 


ASP 


THR 


AS? 


GLY 



PHE 


LYS 


VAL 


LEU 


TTT 


AAG 


GTT 


CTT 


TIT 


AAA 


GTA 


CTT 




A 


GTA 


CT 



The expression plasmid pAKK4 was created in a three-way ligation derived from the following components: 

1) An about 7959 bp fragment of pPR969 was derived by cleavage with Hindlll and EcoRl. 9ug of pPR969 
DNA was incubated with 1X NEBuffer 2 in a total volume of 0.1 ml with 40 units of Hindlll endonuclease 
and 40 units of EcoRl endonuclease for 1 hour at 37° C. Cleavage products were separated on a 0.7% GTG 
grade agarose gel (FMC) run in Tris Borate EDTA buffer. The appropriate band, about 8 kbp, was isolated 
by electroelution using an Elutrap elution apparatus (Schleicher and Schuell) using the manufacturer's rec- 
ommended running conditions. Following elution, the fragment was concentrated by ethanol precipitation 
and the recovery quantified by comparison with known weight standards on agarose gel electrophoresis. 

2) An about 638 bp fragment with Seal and EcoRl termini derived from a PCR product. The reaction mixture 
contained 1 X NEB Vent Polymerase Buffer, 0.1 mg/ml bovine serum alumen, 0.2 mM dNTPs (equimolar, 
each nucleotide), 0.9 u,g/ml pV174-1B1 plasmid DNA template, and 0.01 A^U/ml of primer 72-150 
(5'ATAAAGTACTTTAAAG CCGAACTTTTCCTCTA3') and primer "JACK" (5'CGGCGCATATGATACTGGA- 
CACTGATTAC3'). 0.1 ml of the reaction mix was placed into each of five tubes, and the samples heated 
to 95°C for 3-5 minutes in a Perkin-Elmer Thermocycler. 1 U of Vent DNA polymerase was added to each 
reaction tube, and 15 cycles were run on the thermocycler consisting of 94° C- 0.5 minutes, 50°C - 0.5 
minutes, and 72°C - 2 minutes. The samples were pooled, phenol extracted and ethanol precipitated. The 
sample was resuspended in 50 ul Tris-EDTA buffer and mixed with 40 nl of dH 2 0, 1 0 uJ of 1 0X NEBuffer 
3, 60 units of Seal endonuclease and 60 units of EcoRl endonuclease. After incubation at 37°C for 1 .75 h, 
the reaction products were separated on a 1 .5% agarose gel and the ca. 638 bp fragment was electroeluted, 
and quantified as described above. 

3) An about 358 bp fragment with Hindlll and Seal termini derived from a PCR product. The reaction mixture 
contained 1 X NEB Vent Polymerase Buffer, 0.1 mg/ml bovine serum albumin, 0.2 mM dNTPs (equimolar, 
each nucleotide), 0.9 ug/ml pV174-1B1 plasmid DNA template, and 0.02 A 2 6o/ml of primer 698 (5'GA- 
GACTCG CGGA GAAACTTGGACT3') and primer 73-143 (5TACAGTACTTTATGCGGACACT- 
GACGGCi I I f ATGCCAC3'). 0.1 ml of the reaction mix was placed into each of five tubes, and the samples 
heated to 95°C for 3-5 minutes in a Perkin-Elmer Thermocycler. 1 U of Vent DNA polymerase was added 
to each reaction tube, and 20 cycles were run on the thermocycler consisting of 94°C - 0.5 minutes, 50°C 
- 0.5 minutes, and 72°C-1 minute. The samples were pooled, phenol extracted and ethanol precipitated. 
The sample was resuspended in 50 uJ Tris-EDTA buffer and cleaved with Hindlll and Seal endonucleases. 
The reaction products were separated on a 1 .5% agarose gel and the 358 bp fragment was electroeluted, 
and quantified as described above. 

The ligation reaction contained approximately 1 ug/ml of the pPR969 fragment described above, 0.8 ug/ml 
of the 638 bp fragment described above, 0.4 jig/ml of the 358 bp fragment described above, 1X NEB ligation 
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buffer and 100,000 units/ml T4 DNAIigase. Ligation occured at 16°C for 5 hours Correctly constructed recom- 
binants were identified by the Seal digestion pattern, and transformed into BL21(DE3) plysS to screen for in- 
ducible activity, as described above. Two such isolates, pAKK4 and pAKK15 were used in subsequent studies. 
These two isolates appear to be identical, although they were isolated from independent isolates. 

5 Expression from the new construct pAKK4 appears to yield 3-10-fold more active T. litoralis DNA polymer- 

ase than pPR969 without expression of the endonuclease from the 1170 bp intron. 

An expression vector for production of the exonuclease deficient variant of the T. litoralis polymerase was 
constructed by replacing a 141 7 bp Clal-Sphl fragment from pAKK15 with an analogous 1417 bp fragment from 
pCBA1, the original exonuclease-deficient T. litoralis DNA polymerase construct One such recombinant was 

10 named pAKM8 and was characterized further. 

EXAMPLE XII 

PURIFICATION OF A THERMOSTABLE DNA POLYMERASE FROM PYROCOCCUS SPECIES 

15 ~ 

Pyrococcus sp. strain GB-D (ATCC No. 55239) was grown in the media described by Belkin, et al., supra, 
containing 10 g/l of elemental sulfur in 8 one liter bottles at 94°C for two days. The cells were cooled to room 
temperature, separated from unused sulfur by decanting and collected by centrifugation and stored at -70°C. 
The yield of cells was 1.4 g per liter. 

20 1 1 .5 g of ceils obtained as described above, were suspended in 28 ml of buffer A (1 0 mM KP04 buffer, pH 

7.4; 0.1 mM EDTA, 1.0 mM beta-mercaptoethanol) containing 0.1 M NaCI and sonicated for 5 minutes at4°C. 
The lysate was centrifuged at 15,000 g for 30 minutes at4°C. The supernatant solution was passed through 
a 18 ml Affigel blue column (Biorad). The column was then washed with 50 ml of buffer A containing 0.1 M NaCI. 
The column was eluted with a 300 ml linear gradient from 0.1 to 2.0 M NaCI in buffer A. The DNA polymerase 

25 eluted as a single peak at approximately 1.3 M NaCI and represented 90% of the activity applied. The peak 
activity of DNA polymerase (25 ml) was dialyzed against 1 liter of buffer A containing 100 mM NaCI, and then 
applied to 15 ml Phosphocellulose column, equilibrated with buffer A containing 100mM NaCI. The column was 
washed with 50 ml of buffer A containing 100 mM NaCI, and the enzyme activity was eluted with 200 ml linear 
gradient of 0.1 to 1 .0 M NaCI in buffer A. The activity eluted as a single peak at 0.6 M NaCI and represented 

30 70% of the activity applied. The pooled activity (42 ml) was dialyzed against 500 ml of buffer A and applied to 
a 25 ml DEAE column. The column was washed with 50 ml of buffer A containing 0.1 M NaCI, and two-thirds 
of the enzyme activity passed through the column. The active fractions were pooled (30 ml) and applied to an 
1.0 ml HPLC mono-S column (Pharmacia) and eluted with a 100 ml linear gradient in buffer A from 0.05 to 1 .0 
M NaCf. The activity eluted as a single peak at 0.22 M NaCI and represented 80% of the activity applied. 

35 Purified Pyrococcus sp. polymerase was electrophoresed in SDS 10-20% polyacrylamide gel and stained 

with either Coomassie Blue or the colloidal stain (ISS Problue) previously described to detect protein. A faintly 
staining protein band was seen at about 92,000 to 97,000 daltons; this molecular weight determination was 
obtained by comparison on the same gel to the migration of the following marker proteins (Bethesda Research 
Laboratories): myosin, 200,000 daltons; phosphorylase B, 97,400 daltons; BSA, 68,000 daltons; ovalbumin, 

40 43,000 daltons, carbonic anhydrase 29,000 daltons; b-lactoglobulin, 1 8,400 daltons; lysoyzme 14,300 daltons. 

EXAMPLE XIII 

CLONING OF PYROCOCCUS SPECIES DNA POLYMERASE GENE 

45 - 

Cross hybridization of a Pyrococcus genomic DNA library using radioactive probes prepared from the DNA 
polymerase gene of 7: litoralis allowed for the identification and isolation of a DNA encoding the Pyrococcus 
DNA polymerase. This was accomplished as set forth below. 

In order to determine which restriction enzymes would be most useful in preparation of the Pyrococcus gen- 

50 omic library, Pyrococcus sp. DNA was cut to completion with Eco Rl, BamHI and Hindlll. This DNA was subject 
to agarose gel electrophoresis (Figure 13A) and Southern hybridization (Figure 13B) using a DNA probe pre- 
pared as follows. A reaction mixture containing 1 ng of the first EcoRI fragment of the T. litoralis DNA polymerase 
gene (bp 1-1274, obtainable from bacteriophage NEB#618, ATCC No. 40794) as a template in a commercial 
random priming kit (New England Biolabs, Inc.) was incubated for 1 hour at 37°C to produce a DNA probe of 

55 high specific activity. The probe was hybridized to Pyrococcus sp. DNA prepared above under moderately strin- 
gent conditions (Hybridization: overnight at 50°C, 4X SET, 0.1M sodium phosphate, pH 7, 0.1% Na pyrophos- 
phate, 0.1% SDS, 1X Denhardts solution; Wash Conditions: wash 3X20-30 min. 45°C, 0.1XSET, 0.1M sodium 
phosphate, (pH 7), 0.1% Na pyrophosphate, 0.1% SDS. Maniatis, et al., supra). A single major band at about 

30 
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5 Kb was detected in BamH I cut Pyrococcus DNA. EcoR I and Hind III gave multiple bands with this probe, 
indicating that these enzymes cut within the Pyrococcus polymerase gene. 

Based on the results, a BamH I genomic library was constructed using the phage vector XDASH (Strata- 
gene). Partial and complete BamHI digests of Pyrococcus DNA were prepared. A mixture of the partial and com- 

5 pletely BamHI digested DNA was ligated into the BamHI site of XDASH. The ligation mixture was packaged 
using Gigapack Gold (Stratagene) according to manufacturer's instructions and plated on £ coli ER1458. The 
packaged phage library contained 1 x 10 8 phage per ml. 

32 P-labelled DNA probes of the 3 fragments (bp 1-1274, 1656-2660 and 3069-3737) of the T, litoralis DNA 
polymerase gene (obtainable from NEB#61 9, ATCC No. 40795) were prepared using a random primer kit (New 

w England Biolabs, Inc.). The probes were used according to the method of Benton & Davis (Maniatis, et al. supra) 
to screen the Pyrococcus genomic library using hybridization conditions described above. About one per cent 
of the plaques were positive and ten positive plaques were picked and purified by reinfection and replating 3 
times (until 90-100% of the plaques were positive for each isolate). Large amounts of phage were prepared 
from each isolate and used to infect £ coli cultures. Specifically, plate lysates (Maniatis et al., supra) of phage 

15 were prepared from each isolate and used to infect £ coli cells. 0.1 ml of each plate lysate was mixed with £ 
coli with 0.2 ml of cells (OD6oo=2). The bacterial cells were harvested just before lysis and suspended in 0.05 
M NaCI, 0.01 M Tris (pH 8.0), 0.1 mM EDTA, 0.1% Triton X-100 and 200 ng/ml lysozyme (3 volumes per volume 
of cells) and heated to 37°C for about 1 minute or until cell lysis occured. The lysed extracts were immediately 
heated at 75°C for 30 minutes, centrifuged and the supernatant solution assayed for heat stable DNA polymer- 

20 ase activity, according to the method described above. Three of the ten isolates showed significant polymerase 
activity and the clone (B9) showing the most activity was investigated further. 

The phage DNA was isolated from B9 and the insert DNA was examined by restriction enzyme digestion. 
Digestion with Sal I gave the expected two arms of XDASH plus a 15 Kb insert. Digestion with BamH I gave 
the two arms of XDASH plus three insert fragments of 7, 4.8 and 3 Kb. Each of these fragments were purified 

25 by agarose gel electrophoresis, eluted and ligated into the BamH I site of pUC19. The ligation mixture was used 
to transform £ coli ER2207 which gives white colonies when plasmids contain an insert and blue colonies with 
no inserts on indicator agar media (X-gal plus IPTG). No white transformants were obtained with the 7 Kb frag- 
ment Three whites and twenty-seven blue transformants were obtained with the 4.8 Kb fragment and twenty 
white and twenty-one blue transformants were obtained with the 3 Kb fragment All three 4.8 Kb white colony 

30 transformants expressed heat stable DNA polymerase activity. None of the transformants with the 3 Kb frag- 
ment expressed heat stable polymerase activity. The three clones carrying the 4.8 Kb Pyrococcus DNA frag- 
ment all had about the same specific activity for heat stable DNA polymerase and one was picked for further 
study (NEB#720). This clone designated NEB#720 was deposited with the American Type Culture Collection 
on October 1 , 1991 and bears ATCC No. 68723. A restriction endonuclease map of the 4.8 Kb BamH I fragment 

35 containing the Pyrococcus sp. DNA polymerase gene is shown in Figure 14. A partial DNA nucleotide sequence 
coding for Pyrococcus sp. DNA polymerase (NEB720) is set forth in Figure 18, including the start of the poly- 
merase gene at bp 363 and a portion of the intervening nucleotide sequence (bp 1 839-3420). NEB#720 yielded 
1700 units of DNA polymerase activity per gram of cells and was used for the large scale preparation of this 
enzyme. 

40 A portion of the Pyrococcus sp. DNA polymerase clone has been sequenced (Fig. 18, bp 1-3420). The se- 

quence of the Pyrococcus sp. DNA polymerase is very similar to the 7: litoralis DNA polymerase at both the 
DNA and protein level (similarity calculated using the GCG Bestfit Program, Smith and Waterman, Advances 
in Applied Mathmatics, 2:482 (1981)). Overall, the genes are 66% identical, with 69% identity in the mature 
DNA polymerase amino termini regions (bp 363-1838 in Pyrococcus sp. DNA polymerase) and 63% identical 

45 - in the portion of IVS1 sequenced to date (bp 1839-3420 in Pyrococcus sp. DNA polymerase). The upstream 
regions (bp 1-362 in Pyrococcus sp. DNA polymerase, Fig. 18 and bp 1-290 in T. litoralis DNA polymerase, 
Fig. 6) show no similarity according to the Bestfit Program. 

Similarity at the protein level is even higher. In the 1019 amino acid Pyrococcus sp. DNA polymerase coding 
region, the two polymerases have 83% similarity and 68% identity (Fig. 1 9). When broken down into the mature 

so polymerase amino terminus and IVS1 , the polymerase coding exons are more similar than the intervening se- 
quence, with the mature polymerase amino termini (aa 1-492 in Pyrococcus sp. DNA polymerase) being 89% 
similar, and 78% identical, and IVS1 (aa 493-1019 in Pyrococcus sp. DNA polymerase) being 78% similar and 
60% identical. 

55 
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EXAMPLE XIV 

ARCH AE BACTERIA DNA POLYMERASE SIMILARITIES AT THE DNA LEVEL 

5 The degree of cross-hybridization between the T. litoraiis DNA polymerase gene and the DNA polymerase 

genes from 3 other thermophilic archae bacteria and from Taq DNA was assessed by Southern blot hybridi- 
zation (Maniatis, supra). Chromosomal DNA from T. litoraiis and Pyrococcus sp. (Strain GB-D), 7: aquaticus, 
and two other Pyrococcus strains, G-1-J and G-1-H, were cleaved with either EcoRI or BamHI. 5 of each 
DNA was incubated with 1x NEBuffer (EcoRI buffer for EcoRI endonuclease and BamHI buffer + 1X BSA for 

10 BamHI endonuclease) in a total volume of 60 uJ with 20 units of EcoRI endonuclease or 20 units of BamHI en- 
donuclease for 2 hours at 37°C. Four quadruplicate 0.75ug samples of each of the cleaved DNAs were loaded 
and run on a 1% agarose (SeaKem LE) gel in Tris Acetate EDTA buffer (Maniatis, supra). The gel was stained 
with Ethidium Bromide (f u.g//r)/) for 20 minutes at room temperature and a photograph taken with a ruler besides 
the gel. 

is The DNA was transferred from the gel onto nitrocellulose paper using the method developed by Southern 

(Maniatis supra). Nitrocellulose filter paper (0.45u/n) was cut to the size of the gel and soaked in 200ml of 6x 
SSC (0.9M NaCI, 0.09M Sodium Citrate) for greater than 1 hour at 37°C. Meanwhile, the gel was incubated 
for 15 minutes in 200 ml of 0.25M Hydrochloric acid at room temperature, then rinsed with distilled water. The 
gel was then incubated for 30 minutes in 200 ml 0.5M Sodium Hydroxide, 1M Sodium Chloride at room tem- 

20 perature, then rinsed with distilled water. The gel was then incubated for 30 minutes in 200-mIs 1M Tris HCI, 
pH7.5, 3M Sodium Chloride at room temperature. Transfer of the DNA from the gel onto the nitrocellulose was 
carried out at 4°C in 18X SSC (2.7M Sodium Chloride, 0.27M Sodium Citrate), 1M Ammonium Acetate. After 
6 hours the nitrocellulose was removed and washed in 1x SSC (0.1 5M Sodium Chloride and 0.01 5M Sodium 
Citrate) for 30 seconds. The nitrocellulose filter was air dried and then vacuum dried at 80°C for a further 2 

25 hours and then stored at room temperature. 

Four gel purified fragments of T. litoraiis DNA polymerase DNA, (1.3 kb Eco Rl fragment from bp 1-1274 
representing the 5* polymerase coding region; bp 471 8-5437, representing the 3' polymerase coding region; 
bp 2448-2882, representing part of IVS1 ; and bp 3666-4242, representing part of IVS2, Figures 6 and 1 5) were 
radiolabeled using the New England Biolabs Random Primer Kit. 100ng of the above template DNAs, each in 

30 a volume of 35.5 u.l, were boiled for 5 minutes in a boiling water bath and then cooled on ice for 5 minutes and 
spun down. The template DNAs were incubated with 1X labelling buffer (includes random hexanucieotides), 
1/10 volume dNTP mix, 25uCi a 32 P dCTP and 5 units DNA Polymerase l-Klenow fragment in a total volume of 
50nl for 1 hour at 37°C. The reactions were stopped with 0.0 18M EDTA. The probes were purified using an 
Elutip minicolumn (Schleicher and Schuell) following the manufacturers recommended elution conditions. The 

35 total number of counts were calculated for all purified probes. The 1.3 kb Eco Rl fragment probe (bp 1-1274) 
yielded 24 x 10 6 cpm, the 3' polymerase probe (bp 4718-5436) yielded 22 x 10 6 cpm, the IVS1 probe yielded 
54 x 10 6 cpm, and the IVS2 probe yielded 47 x 10 6 cpm. 

Hybridization was carried out as follows (Maniatis supra). The nitrocellulose filter was incubated for 30 min- 
utes in 5mls prehybridization buffer (0.75M Sodium Chloride, 0.15M Tris, 10 mM EDTA, 0.1% Sodium Pyro- 

40 phosphate, 0.1% Sodium Lauryl Sulphate, 0.2% Bovine Serum Albumin, 0.2% Ficoll 400, 0.2% PVP and 100 
ng/ml boiled calf thymus DNA) at 50°C. Each nitrocellulose filter was then placed in separate bags with 5mls 
hybridation buffer (as above except 0.03% Bovine serum albumin, 0.03% Ficoll 400, and 0.03% PVP). Each 
section was hybridized with 22-25 x 10 e cpm of denatured probe overnight at 50°C. 

The nitrocellulose filters were removed from the bags and incubated 3 x 30 minutes with 0.1X SET Wash 

45 - (l5mM NaCI, 3mM Tris base, 0.2 mM EDTA, 0.1% SDS, 0.1% Sodium Pyrophosphate and 0.1 M Phosphate 
Buffer) at 45°C. The filters were kept moist, wrapped in Saran Wrap and exposed to X-ray film for various times 
ranging from 4 hours to 3 days. 

The results are shown in Figure 16. In Figure 16, parts A through D are autoradiographs of quadruplicate 
Southern blots. Lanes 1-5, DNA cut with EcoRI. Lanes 6-10, DNA cut with BamHI. Lanes 1 & 6, Pyrococcus 

50 S p. G-1-J DNA; Lanes 2 and 7, Pyrococcus G-1 -H DNA; Lanes 3 & 8, T. litoraiis DNA; Lanes 4 and 9, Pyrococcus 
sp, GB-D DNA, Lanes 5 & 10, T. aquaticus DNA. The hybridization probes are as follows: part A, 5' coding region 
of T. litoraiis DNA polymerase gene, bp 1-1274; part B, 3' coding region of T. litoraiis DNA polymerase gene, 
bp 4718-5437; Part C, partial IVS2 probe, bp 3666-4242; Part D, partial IVS1 probe, bp 2448-2882. The upper 
and lower panels of parts C and D represent shorter and longer exposures, respectfully, of the same blots. 

55 None of the 4 probes hybridized to Taq DNA. Both polymerase coding region probes hybridize to specific 

bands in all Thermococcus and Pyrococcus DNAs, but not Taq DNA. Good signals were obtained with both 
probes indicating strong conservation of both the amino and carboxy terminal ends of the T. litoraiis DNA Poly- 
merase coding region. The amino terminal regions of T. litoraiis and Pyrococcus sp. GB-D are about 69% iden- 
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tical (see, e.g. Figs. 6 and 1 8) and very similar at the protein level (Fig. 1 9). The I VS1 probe hybridized strongly 
to 7". Iitoralis and Pyrococcus sp. GB-D DNAs (about 63% identical over a 1582 bp region) and weakly to Pyr- 
ococcus sp. G-1-H DNA. The IVS2 probe hybridized strongly to T. Iitoralis DNA and weakly to Pyrococcus sp. 
G-1-H DNA. 

EXAMPLE XV 

ARCH AE BACTERIA DNA POLYMERASE SIMILARITIES AT THE ANTIBODY LEVEL 

Pellets from 1 ml cultures of T. Iitoralis, and Pyrococcus strains were resuspended in 100uJ Urea lysis buffer 
(4M Urea, 0.12M Tris, 4% Sodium Lauryl Sulphate, 10% (J-mercaptoethanol, 20% glycerol and 0.002% Bro- 
mophenol Blue) and boiled for 3 minutes. The boiled samples were sheared with 25G5/8 needle to reduce the 
viscosity of the samples. Duplicate 10uJ samples of T. iitoralis, and Pyrococcus strains G-1-J and G-1-H, and 
also samples of purified Taq DNA polymerase, E. coli DNA polymerase and purified DNA polymerase from Pyr- 
ococcus sp. (GB-D) were loaded onto 10-20% SDS-PAGE gels and run in Protein Running Buffer (0.1 % Sodium 
Lauryl Sulphate, 0.19M Glycine, and 0.025M Tris Base). Nitrocellulose filters (45u.m) were soaked in distilled 
water for 5 minutes and then soaked in Transfer buffer (0.15% ethanolamine, 20 mM Glycine and 20% Metha- 
nol) for 30 minutes. The protein on the gels were electroeluted (30 volts, overnight at 4°C) onto the nitrocellulose 
filters in Transfer buffer (Towbin, et al. PNAS (1979) 76:4350-4354). 

The nitrocellulose was removed, marked with a ball point pen and washed for 5 minutes in TBSTT (20 mM 
Tris, 150mM Sodium Chloride. 0.2%Tween 20, and 0.05% Triton X-100). The filters were blocked for 30 minutes 
in TBSTT + 3% nonfat dry milk (Carnation), and washed 3x3 minutes in TBSTT. The anti-7! Iitoralis DNA poly- 
merase antisera was raised against a partially purified native DNA polymerase preparation. T. Iitoralis DNA poly- 
merase specific sera was prepared by affinity purification on Western blot strips of purified native enzyme (Beall 
et al., J. Immunological Methods86:21 7-233 (1983)). Affinity purified anti-T. Iitoralis DNA polymerase mouse 
antibody (V76-2+3) and monoclonal anti-Taq polymerase antibody (diluted 1:100 in TBSTT) were added sep- 
arately to each nitrocellulose filter for 5 hours at room temperature. The filters were washed 3x3 minutes with 
TBSTT and then reacted with a 1:7500 dilution of anti-mousse secondary antibody conjugated with alkaline 
phosphatase (Promega) in TBSTT for 1 hour at room temperature. The nitrocellulose filter was developed with 
NBT/BCIP as instructed by the manufacturers (Promega). The results using Taq monoclonal are shown in Fig- 
ure 17. Figure 17 is a Western blot of crude lysates from T. Iitoralis (V), Pyrococcus sp. G-1-J (J), and Pyro- 
coccus sp. G-1-H (H), or purified polymerases from Pyrococcus sp. GB-D (DV), T. aquaticus (T) or E. coli (E) 
reacted with affinity purified anti-7: Iitoralis DNA Polymerase antibody in Part A or anti-Taq DNA polymerase 
monoclonal antibody in Part B. The arrow indicates the position of the T. Iitoralis and Pyrococcus sp. DNA Poly- 
merase proteins. The reactivity in Part B is to background proteins and not to the DNA polymerases as seen 
in part A. 

Monoclonal antibody specific to Taq DNA polymerase does not cross-react with protein form the Pyrococ- 
cus and Thermococcus strains tested. 

However, the 90-95,000 dalton DNA polymerase proteins from T. Iitoralis and the 3 Pyrococcus strains re- 
acted with the affinity purified anti-T. Iitoralis DNA polymerase antibody. This is not surprising, considering the 
high degree of both similarity and identity between T. Iitoralis and Pyrococcus sp. GB-D DNA polymerases (Fig 
19). 

Figure 19 is a comparison of a portion of the deduced amino acid sequences of recombinant T. Iitoralis 
and the partial sequence of recombinant Pyrococcus sp. DNA polymerase. The Pyrococcus DNA polymerase 
deduced amino acid is listed on the upper line, and the deduced amino acid sequence of recombinant 7: Iitoralis 
DNA polymerase is listed on the lower line. Identities are indicated by vertical lines, similariteies are indicated 
by 1 or 2 dots, nonconserved substitutions are indicated by blank spaces between the two sequences. 

EXAMPLE XVI 

In order to obtain recombinant thermostable DNA polymerase from a target archaebacterium, several basic 
approaches to cloning the target DNA polymerase gene can be followed. Initially, one attempts to determine 
immunologically whether the new polymerase is a member of the Pol a or Pol ! family by Western blot analysis 
of purified polymerase (although crude polymerase lysates may work with reduced sensitivity) using anti-Taq 
DNA polymerase or anti- T. Iitoralis DNA polymerase sera, as described in Example XV of this invention (Figure 
17). If the new polymerase reacts with anti-Taq Polymerase monoclonal, then it probably cannot be easily 
cloned using reagents generated from T. Iitoralis DNA Polymerase. If the new polymerase cross-reacts with 
anti-7: Iitoralis sera, then one should be able to clone it with the following procedures. If the new polymerase 



33 



EP 0 547 920 A2 

fails to react with either sera, then the experiment is considered inconclusive and one should go onto the next 
step, DNA cross-hybridization. 

Optimum probes and DNA hybridization conditions must be experimentally determined for each new or- 
ganism. At the same time, various restriction digests of DNA from the new organism are tested in order to find 
5 enzymes which yield fragments which hybridize to the 7". litoralis probe and are large enough to encode the 
new polymerase. 

Probe selection can vary with respect to size and regions of the T. litoralis DNA Polymerase gene. Optimum 
probes can be determined by performing test Southern blots as described below with large or small DNA frag- 
ments, or even oligomers. One could select probes that are totally from within the IVS sequences to look for 

10 the presence of IVSs in new archaebacterium DNA polymerase genes, or probes could be limited to mature 
polymerase coding regions. Using the entire T. litoralis DNA Polymerase gene region as the probe has several 
advantages and disadvantages. The major disadvantage is that the larger the probe, the more likely to yield 
spurious hybridization at very low stringency. Among the advantages of using larger probes are (1) they are 
more likely to cross-hybridize to another polymerase which may have diverged greatly from the T. litoralis DNA 

15 Polymerase gene in one small portion of the polymerase, and (2) they are more likely to detect internal restric- 
tion sites in the new polymerase gene since the probe spans the amino- and carboxy-termimi of the T. litoralis 
DNA Polymerase gene. It is important at the initial stages of probing to use several restriction enzymes to cleave 
the DNA from the new archaebacterium to find one or more enzymes which yield preferably one, or possibly 2 
bands, which hybridize to the 7* litoralis DNA Polymerase probe and which are large enough to encode the 

20 new polymerase. The minimum coding sequence required for the new polymerase can be estimated from the 
size of the new polymerase determined by Western blots (assuming a factor for IVSs, if desired) or, by guessing 
at greater than 4 KB as a first approximation. Maximum fragment size is limited by the cloning capacity of the 
desired vector. 

Optimum hybridization conditions are experimentally determined by performing test Southern blots at va- 

25 rious wash temperatures. Hybridization is earned out at 50°C in 4X SET, 0.1 M sodium phosphate, pH 7, 0.1% 
Na pyrophosphate, 0.1% SDS, 1X Denhardts solution, although any low stringency hybridization condition 
would also be suitable (Maniatis). Wash conditions are varied from 37-55°C, 3 x 30 minutes with 0.1X SET 
wash (15mM NaCl, 3mM Tris base, 0.2mM EDTA, 0.1% SDS, 0.1% Sodium Pyrophosphate and 0.1M Phos- 
phate Buffer), although any standard low stringency wash conditions can also be used. The point of this part 

30 of the experiment is to hybridize the probe and wash the Southern blot at low stringency to insure some level 
of cross- hybridization which may even include non-specific cross-hybridization. Next, one increases the wash 
stringency, for example, increasing the wash temperature in 3-5°C increments and then monitoring the disap- 
pearance of hybridized probe as determined by a decrease in signal upon autoradiography. Initially, one ex- 
pects to see many bands hybridizing to the probe at low stringency. As the wash stringency increases, weakly 

35 hybridizing sequences melt off and disappear from the autoradiograph. As wash stringency is increased, con- 
ditions are established at which only one or a few bands still hybridize to the probe. These are the conditions 
to be used in future experiments. As stringency increases beyond this point, all hybridization signal is lost. The 
goal is to determine the most stringent condition where one or a few bands per digest still hybridize to the probe 
before all hybridization signal is lost. 

<o |f initial probing with a large T. litoralis DNA polymerase gene fragment fails to give a clear pattern using 

any hybridization conditions, then smaller probes can be tested until a good partnership of probe size and hy- 
bridization conditions are established. Alternatively, Example XIV of the present invention shows that several 
fragments spanning different regions of the T. litoralis DNA polymerase gene (amino terminus, IVS1 , IVS2 and 
carboxy terminus, Figures 1 5 and 1 6)) can be used in separate Southern blots, but tested in parallel at the same 

45 - time. 

Libraries are constructed with the optimum restriction digests and hybridized with the optimized probe. A 
parallel approach is to clone in expression vectors and directly screen with anti-I litoralis sera. Either primary 
approach may yield active or inactive product. If no active polymerase is detected, the clone is checked for 
insert size and reactivity to anti-7: litoralis sera. If there is no reactivity to anti-7*. litoralis sera, then the poly- 

50 merase may not be expressed from its own control sequences in E. coli and the plasmid insert must be se- 
quenced to operably link the new polymerase to an £. coli promoter and perhaps translation signals. 

In the present invention, we have identified introns or intervening sequences in Pol a conserved region 
motifs in both 7*. litoralis and Pyrococcus sp. DNA polymerase genes. We therefore predict that other Archae 
DNA polymerase genes may have introns in conserved motifs also. If the new polymerase clone is inactive, it 

55 should be checked for the presence of intervening sequences. These introns can be identified in 2 ways. If these 
introns are related to introns found in T. litoralis and Pyrococcus sp. DNA polymerase genes, they can be iden- 
tified by low stringency hybridization to DNA probes derived from intra n sequences of T. litoralis and Pyrococcus 
sp. DNA polymerase genes. If IVSs are found, the clone is sequenced to develop strategies for removal of the 
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IVS. If the clone is inactive and no cross-hybridizing IVSs are found, then the plasmid is sequenced to look for 
new IVSs. The archaebacterium DNA polymerase gene can be sequenced at the DNA level and the sequenced 
compared to (1) other DNA polymerases to identify non-similar segments (2) conserved motifs to look for the 
absence of Regions l-VI, followed by identification of interruption points in Regions which are absent. Once 
identified, introns can be removed in vitro by any number of techniques known in the art, some of which are 
described in this application with respect to to removal of IVS1 and IVS2 from the T. litoralis DNA polymerase 
gene. 

If the primary library screening fails to produce a clone synthesizing active thermostable DNA polymerase, 
but does result in a partial gene clone as determined by (1) cross-hybridization at the DNA level, (2) cross- 
reactivity at the antibody level, and (3) similarity to other DNA polymerases at the DNA sequence or deduced 
amino acid sequence levels, then more genomic Southern blots are probed with the initial clone to identify re- 
striction enzymes to be selected for making the next library. The second library should contain larger fragments 
which are more likely to encode the entire polymerase gene. The library is screened with either antibody or 
preferably, the initial new polymerase cloned sequence. The resultant positives are checked for thermostable 
DNA polymerase activity. If no active thermostable DNA polymerase is detected in this second round, then in- 
tervening sequences can be screened for by cross-hybridization and DNA sequencing. DNA sequencing can 
also indicate whether the cloned gene is complete by establishing the presence of all the conserved polymerase 
motifs and a stop codon in the polymerase open reading frame. Several rounds of screening and rescreening 
may be necessary before finally cloning an active thermostable DNA polymerase. 

It should also be noted that the above screening and rescreening procedure may not be sufficient for cloning 
the new thermostable polymerase gene because of toxic elements present in the gene. In this case, cross- 
reactivity at the DNA or protein level is an excellent method of cloning because only partial, inactive products 
can initially be cloned which will allow subsequent cloning of the complete gene. If obtaining the complete gene 
is not straightforward using the strategy outlined above, one should look for the presence of intervening se- 
quences like IVS2 which are very toxic when cloned. This is accomplished by either looking for deletions and 
rearrangements in polymerase clones or by probing for known toxic 7! litoralis IVS sequences. Duplicate South- 
ern blots are probed with polymerase coding regions and IVS sequences to locate toxic IVSs in proximity to 
the polymerase coding region. If rearrangements or toxic IVSs are found, then the appropriate strategy would 
be to first operably link the amino terminal of the polymerase to a very tightly controlled expression system as 
described in this present application. Once accomplished, the remainder of the polymerase gene can be cloned 
and ligated to the amino terminus, reducing expression of toxic elements such as the T. litoralis IVS2 sequence. 
Alternatively, cross-hybridizing sub-fragments of the polymerase gene can be isolated, checked for IVSs by 
hybridization or DNA sequencing, IVSs can be removed in vitro from these regions by methods known in the 
art The complete polymerase gene can then be constructed by ligation of sub-fragments from which toxic ele- 
ments have been removed. 



Claims 

1. Recombinant thermostable DNA polymerase from archaebacteria which is encoded by a DNA sequence 
which hybridizes to a nucleotide sequence selected from the group consisting of the nucleotide sequence 
of Figure 6 or a portion thereof, nucleotides 1 to 1274 of Figure 6, nucleotides 1 269 to 2856 of Figures 6 
and nucleotides 2851 to 4771 of Figure 6. 

2. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 20 nucleotides in length. 

3. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 50 nucleotides in length. 

4. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 150 nucleotides in length. 

5. Recombinant thermostable DNA polymerase from archaebacteria which hybridizes to an antibody probe 
which has antigenic specificity to T. litoralis DNA polymerase. 

6. Isolated DNA which codes for the recombinant thermostable DNA polymerase of claim 1 . 

7. A cloning vector comprising the isolated DNA of claim 6. 
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8. A host cell transformed by the vector of claim 7. 

9. A method for producing a recombinant thermostable DNA polmerase from archaebacteria comprising cul- 
turing a host cell transformed with the vector of claim 7 under conditions suitable for the expression of the 
DNA polymerase. 

10. A DNA probe which hybrid izes to the DNA sequence coding for the archaebacteria thermostable DNA poly- 
merase of claim 1 , wherein the DNA probe is selected from the group consisting of the nucleotide sequence 
of Figure 6 or a portion thereof, nucleotides 1 to 1274 of Figure 6, nucleotides 1269 to 2856 of Figure 6 
and nucleotides 2851 to 4771. 

11. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
20 nucleotides in length. 

12. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
50 nucleotides in length. 

13. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
150 nucleotides in length. 

14. A method for isolating DNA coding for thermostable DNA polymerase from archaebacterium comprising 
the steps of: 

(a) forming a genomic library from the archaebacterium; 

(b) transforming or transfecting an appropriate host cell with the library of step (a); 

(c) contacting DNA from the transformed or transfected host cell with a DNA probe selected from the 
group consisting of the nucleotide sequence of Figure 6 or a portion thereof, nucleotides 1 to 1274 of 
Figure 6, nucleotides 1269 to 2856 of Figure 6 and nucleotides 2851 to 4771; 

(d) assaying the transformed or transfected cell of step (c) which hybridizes to the DNA probe for DNA 
polymerase activity; and 

(e) isolating a DNA fragment which codes for the thermostable DNA polymerase. 

15. A method for isolating DNA coding for thermostable DNA polymerase from archaebacterium comprising 
the steps of: 

(a) forming a genomic library from the archaebacterium; 

(b) transforming or transfecting an appropriate host cell with the library of step (a); 

(c) contacting extract from the transformed or transfected host cell with an antibody probe which has 
specific affinity for 7: litoralis DNA polymerase; 

(d) assaying the transformed or transfected cell of step (c) which is cross-reactive to the antibody probe 
for DNA polymerase activity; and 

(e) isolating a DNA fragment which codes for the thermostable DNA polymerase. 

1 6. A method for increasing the expression of a thermostable DNA polymerase from archaebacteria compris- 
ing the steps of: 

(a) identifying and locating any intervening nucleotide sequence in the isolated DNA of claim 14 or 15; 
and 

(b) removing the intervening nucleotide sequence from the isolated DNA. 

17. The method of claim 16, wherein the archaebacteria comprises T. litoralis. 

18. The method of claim 17, wherein the intervening nucleotide sequence is setected from the group of IVS1 
IVS2 or IVS1 and IVS2. 

19. The method of claim 16, wherein the intervening nucleotide sequence is identified and located with a DNA 
probe coding for an intron from the DNA sequence encoding 7: litoralis DNA polymerase. 

20. The method of claim 19, wherein the DNA probe is selected from the group consisting of a 1614 bp nu- 
cleotide sequence of Figure 6 comprising nucleotides 1776 to 3389 or a portion therof and an 1170 bp 
nucleotide sequence of Figure 6 comprising nucleotides 3544 to 4703 or a portion thereof. 

21. A thermostable endonuclease obtainable from T. litoralis which cleaves double-stranded deoxy nucleotide 
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acid pBR322 at position 164 and 2411. 
22. The endonuclease of claim 21 , having a molecular weight of about 33,000-37,000. 
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Fig. 1A. SDS-Polyacrylarnide Gel of Purified 
T. litoraiis DNA Polymerase 




Lane 1: Molecular weight markers 

Lane 2: Purified T. litoraiis DNA Polymerase 
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THERMAL STABILITIES OF DNA POLYMERASES 

FIG. 3A 
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GMITCGOm TAAAATCTAT TITCITCCTC CAXTinQA XTTCAAAAAC CTAAGCATC* 60 

GCCAAAOCTC TOSOOTTTC TCTCTCCTIC CXGCTAACCC TCITCAAAAC raCTCQM 120 

«axrrrrrc KnaMccrc Ma. w aicsgggtc* ctmmctcc wwasricG 180 

TGASGGGTEA TTCTCrAGAA CAACTCCA1G ATTTKXIATT TGGATGGGGG TIIftAMaXT 240 

TGGCGGAACT IIJKITIMI TTGAACTOT. GirDfflMCT GGTGGTATTT ATGAT7-.CTGG 300 

Aocrcaxra cajt^aaa gmggcaagc ctaimtccg aattitiwc juugaewcg 36 o 

GGGMTTTRA AKDCAACIT GPOCCTCATr TTOGOCCia TATAXATGCT CTKTCWvG 420 

KKPCTOOGC raiTOTGGM A33AA3GCAA TAWGGGGSA GMACXTCGA AAAACTCTGA 480 

GACTGCTCGA TGCAGTGAAA AAXnTTGGG AAGGGAAGTT GAAGTCTGGA 540 

MCTOWCTTT CGSGCATCCC OUUSCSTTC CVGCiaTGCG GGGCRAAATA AGGG71ACATC 600 

CMCTGTCGT TGaaOTTSC GAATAIGACA TACCCTTTGC CAAGCCTERT CTCVE\G\CA 660 

AQ3GCITCAX TCCCAIGG7-C GGSGMGAGG SGCTEWCCT C Ul l UJlT ri ' GATMTGAAA 720 

OCTITrAICa TGaGGGSGAX GAXTITGGAA ftGGGCGMYT AATMTCATT AGTTATOCCG 780 

AXGRAGMGA GGOQGAGTA XTCRCMGGA AAAAEVTCGA TITQCCGTKr GTCGRTGTIG 840 

TCTOCAAIGA AAGAGAAAXG AXAAAGOGTT TTCJITCVAGT TGTIAAAGRA AAAGAOXCS 900 

JlTGTGATAAT AACTEACAAT GGGGACAATT TTEtfTTGCC GT7VTCTCVTA AAACGGGOVG 960 

AAMGCTGGG AGTTCGGCTT GTCTTAGGAA GGSVSAAGA ACAICCCG?* OCCyVCTVTIC 1020 

AGSGGATCGG TGKEW3ITJLT GCTGTGGAAA TCAAGGGdG AATCCRCTTT GAICTmCC 1080 



FIG. 6-1 



45 



EP 0 547 920 A2 



<=X^^CG AAGGAOGA^ AACCKXX^A CGI^ACGCT ^^CCTT ^ G 
Tm*GGAAA AAOCAAAAGC AAAXTAGGAG CAGAGGAAAT TGCCGCT7\T7\ TGGGAAAG\G 
AAGAAAGCAT GAAAAAACTA COX^CT CAA^GAAGA TGCTAGGGCA AC^T^GC 
^C^AGGA ATICITOOOC ATCGAAGCTG AC^GGCAAA GOGmGGT CAAAGTGT7VT 
GGGAOCTCTC GAGATCAAGC AO03GCTVACC TOTIGGAGIG GTATCTTITA AGGGTGGCAT 
ACSGGAGGAA TGAACTIGCA COGAAC^AC CTGAXGAGGA AGAGTATAAA CGGCGCTTAA 
CTAC ^™ CCTGGGAGGA TATCI^AAAG AGCCAGAAAA AGGTTTGTGG GAAAATATCA 
TTTATITGGA TTTOCGCAGT CTGTACCCIT CAATAATAGT ^CICACAAC GI^TCCCCAG 
M^OCCXTCA AAAAGAGGGC TtHAAGAATT ACGATGTTGC ICCGA^CIA GGA17YI7\GGT 
TCTGCAAGGA CTTTOOGGGC TITATrCCCT CCATACKXG GGACTTAAIT GCAAT^GGC 
AAGAIATAAA GAAGAAAATG AAATCCACAA TTGACCCGAT CX^AAAGAAA ATGCTCGATT 
MMGCAAAG GGCXWTftaA TTGCITGCAA ACAGCATCTT ACCCAACGAG TGGTTACCAA 
TAATTGAAAA TCGAGAAA2* AAATTCGPGA AAA1TCGCGA GITIATAAAC TCTTACATGG 
AAAAACAGAA GGAAAACCTT AAAACAGTAG AGAATACTGA AGTTCTOGAA GTAAAC^CC 
nTITGCATr CICArrCAAC AAAAAAATCA AAGAAAGIGA ACTCAAAAAA GICAAAGCCC 
TCATAAGACA TAAGTATAAA GGGAAAGCTT ATGAGATTCA GCTTAGCTCT C^C^AAA 2040 
ITAACATAAC TGCIt^CCAT ACTCTGTTTA QGTTAGAAA TGGAGAAATA AAGGA^CTXTT 2100 
CTCGAGATGG GATAAAAGAA GGIGACCTTA TrUZAGCACC AAAGAAAATT AAACTOATX3 2160 
AAAAAGGGGT AAGCATAAAC ATTOCOGAGT TAATCTO\GA TCTTTCCGAG GAAG7vw\CAG 2220 
OCGACAITGT GATGACGAXT TCAGCCAAGG GCAGAAAGAA CTTCTTTAAA GGAATGCTGA 2280 
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GMc-x-ri*ftG CTGGAlVm' GGAGAAGAAA AIAGAAGGAT AAGAACA1TT AATCGCIMT 2340 

TCTicmrcr cgaaaaacta ggccitatca aaciactgcc ccgcggaiat gaagttacig 2400 

ACTGGGAGAG ATIAAMAAA TAXAAACAAC TTEAOGAGAA GdTGCTGGA AGCGTIAAGT 2460 

ACAACGGAAA CAAGAGAGK TAXITAGTAA TCTTCAACGA GATCAAGGAT TTIMATCXT 2520 

ACTTCCCACA AAAAGAGCTC GAAGAATGGA AAATTCGAAC TCTCAATGGC TTTAGAACGA 2580 

ATTGIATTCT CAAAGTOGAT GAGGATTITG GGAAGCTOCT AGGTTACTAT GTTAGTGAGG 2640 

GCTATGCAGG TGCACAAAAA AATAAAACTG GTGGTATCAG TTATTOGGTG AAGCTTTACA 2700 

ATCAGGACCC TAATGTTCTT GAGAGCATGA AAAATGTTGC AGAAAAAXTC TTTCGCAAGG 2760 

TTAGAGTTGA CAGAAATTGC GEAAGTA1AT CAAAGAAGAT GGCATACTTA GITATGAAAT 2820 

GCCTCICTGG AGCATTAGOC GAAAACAAGA GAATTCCITC TGTTMACTC ACCTCTOCOG 2880 

AACCGGTACG GTGGTCVITT TTAGAQGCGT ATTTE?£AGG CGAXGGAGAT ATACATCCAT 2940 

CAAAAAGGTT T?GGCTCTCA ACAAAAAGCG MCTCCTGC AAATCAGdT GTGTTCtTGC 3000 

TGAACTCTTT GGGAAIAXCC TCTGTAAAGA TAGGCITTGA CAGTGGGGTC TATAGAGTGT 3060 

AIAIAAATGA AGAOCTGCAA TTTCCACAAA CGTCTAGGGA GAAAAACACA TACTACTCTA 3120 

ACTTAATTCC CAAAGAGATC CTTAGGGACG TGTTTGGAAA AGAGTTCCAA AAGAACATGA 3180 

CGTTCAAGAA ATTIAAAGAG CTTGTTGACT CTGGAAAACT TAACAGGGAG AAAGCC\AGC 3240 

TCTTGGAGTT CTTCATTAAT GGAGATATTG TCCTTGACAG AGTCAAAAGT GTTAAAGAAA 3300 

AGGACTATGA AGGGTAXGTC TATGAOCTAA GCGTTGAGGA TAAOGAGAAC TTTCTTGTTG 3360 

GTriTGCTIT GCLUIATGCT CACAACAGCT ATTACGGCIA TATGGGGTAT CCTAAGGCAA 3420 

GKTGGXPCTC GAAGGAATGT GCTGAAAGCG TTAOCGCATG GGGGAGACAC TT^CATAGAGA 3480 

FIG.6-3 
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ACAWXAGM GAAAAGTICG GCXTTAAGCT TCITTATGOG GAQVGTGTCT 
T^I^ AIAAGGCAAA AOGGAAAGAT TMnTTTGIG AAAAI7VAAGG 
^TTICTC OMGGIGGAC XAQGCA1TG GCGAAAAAGA ATACIGCATT CltGA^SCIG 
TTCAAGCACT AACICIGGAC GATCAGGGAA AGCTItalCTG GAAGCCCGTC OCCTACGTGA 
*S*GCACAG ^OGAAXAAA AGAATGTTOC GCATCTGGCT GACCAAC*GC TGCnMMM 
ATGTTACTGA GGATCATTCr CTCATAGGCT A^I^AAC^ GICAAAANS AAAACIGCCA 
AAAAAATCGG GGAAAG^ AAGGAAGTAA AGCCTITIGA AXTAGGCAAA GC^AAAT 
CX3CTCATATG COIAAATCCA COmMKG ATCAGAAIAC CAAAACTAGC GAAATTVGC^jG 
TAAAATICTG GGATTGATTG TAGGAGAI^ AAACTGGGGT GGAGATTCTC 

OTGGGC^GA CH^TATCIT GGACITICAA CAGGCAAAGA TQCAGAAGAG ATAAAGCAAA 
MCITCIOGA AOOOC^ 

AMGGGACIT CAACATCITG GCAAAGAGCC TTGTAAAGTT TAXGAAAAGG QCTIT^ 
MGAAAAAGG AAGAOGAAAA ATTOCAGAGT TC?iTGTATGA GCITOCGGTT ACTDVOTas 
MGCATITCr ACCTOTCTG TTTTOGCTG ATGGTACTGT AACTMOGG AAGGGAGTTC 
Q^GATCAG GCEAAQAAC ATTGATGCTG ACTTTCTAAG GGAAGTAAGG AAGCTTCTGT 
GGATTGTCGG AATITCAAAT TCAATATTTG CIXMACTAC TCXZAAATOGC T^AATGCIG 
xi-ivi-ACTGG AAOCTACTCA AAGCATCTAA GGATCAAAAA TAAGTGGOGT TTTGCT3AAA 
GGATAGGCTT TTTAATCGAG AGAAAGCAGA AGAGACTTTT AGAACATTTA AAATC-GCCA 4S60 
GGCTAAAAAG GAAXACCATA GATriTGGCT TTGATCTICT GCAIGTGAAA AAAGTCGAAG 4620 
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AGAIAOCAI* CGAGGCT3*C GTTmiGACA TTGAAGICGA AGAGACGCAT AGGTTCTTTG 4630 

CAAACAACAT CCTGCTACAC AAXACTGACG GCTTTIATGC CACAAmCCC GGGGAAAAGC 4740 

CICTACTCAX 1MAAAGAAA GCCAAGGAAX TOCTAAACI3V CAXAAACICC AAACITCCM 4800 

^ICIGCITGA GCTTGAGTAT GAGGGCnTT ACITCAGAGG A1TCTTTGTT ACAAAAAAGC 4860 

GCHMGCAGT CAXAGATGAA GAGGGCAGGA TAACAACAAG GGGCITCGAA GTAGTAAGGA 4920 

GAGATTGGAG TGAGATAGCT AAGGAGACTC AGGCAAAGGT TTTAGAGOCT AI7VCT-7AAG 4g80 

AGGGAAGTCT TGAAAAAGCT GTAGAAGTTG TTAGAGATGT TCTAGAGAAA ATAGCAA?AT 5040 

ACAGGGITCC ACTTGAAAAG CTTGTTATCC ATGAGCAGAT TACCAGGGAT TTAAAGGACT 5100 

J^VAAGOCAT TCGCCCTCAT GTCGCGATAG CAAAAAGACT TCCCGCAAGA GGGATAAAAG 5160 

TGAAACCGGG CAC\AIAAIA AGCTATATCG TTCTOAAGG GAGCGGAAAG ATAAGCGATA 5220 

GGGTAAXTCT ACITACAGAA TACGATCCTA GAAAACACAA GXACGATCCG GACTACTACA 5280 

TAGAAAACQ ACTTTTGCOG GCAGTACTTA GGA33CTCGA AGCGTTTGGA TOCAGAAAGG 5340 

AGGATTTAAG CTATCAAAGC TCAAAACAAA CCGGCITPGA TGCATGGCTC AAGT-JGGTAGC 5400 

TCTGITGCTT TTTACTOCAA GTTTCTCCGC GAGTCTCTCT ATCTCTCITT TtJIATICIXSC 5460 

TATCTGGTTT TCATICACTA TTAAGTAGTC CGCCAAAGCC AXAACGCTTC CAATTCCAAA 5520 

CTTGAGCTCT TTCCAGTCTC TGGCCTCAAA TTCACICCAT GTTTTTGGAT GGTCGCITCr 5580 

CCCTCTTCTG CTAAGCCTCT CGAATCTm TCTTGGCGAA GAGTGTACAG CTATCATGAT 5640 

TATCTCTTCC TCTGGAAACG CATCTTTAAA OGTCTGAATT TCATCTAGAG ACCTCACICC 5700 

GTCGAXTATA ACTGCCTTGT ACTTCTTTAG TAGITCITIT ACCTTTGGGA TCGTTAATTT 5760 

TGCCAOGGCA TTGTCOCCAA GCTCCTGCCT AAGCTGAATG CTCACACTGT TCATACCTTC 5620 

GGGAGTTCTT GGGATCC 5837 

FIG.6-5 
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Pyrococcus sp. DNA Polymerase Gene Region 
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FIGURE 16 (cone.) 
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GGATCCCTCTCTTTTTGGTAACCCCATACGTCATTCCCTCAACCAAAACT 
TCAGCATCGTTGCAGTGGTCAGTGTGTCTGTGGGAGATGAAGAGGACGTC 
GATTTTTCTGGGGTCTATCTTGTATCTCCACATTCTAACTAACGCTCCAG 
GCCCAGGATCAACGTAGATGTTTTTGCTCGCCTTAATGAAGAAGCCACCA 
GTGGCTCTTGCCTGCGTTATCGTGACGAACCTTCCACCACCGCCACCGAG 
AAAAGTT AT CTCT AT C ATCTC AC AC CTCCCCC AT AACATCACCTGCTC A A 
TTTTTAAGCGTTCTTAAAGGCTTAAATACGTGAATTTAGCGTAAATTATT 
GAGGGATTAAGTATGATACTTGACGCTGACTACATCACCGAGGATGGGAA 
GCCGATTATAAGGATTTTCAAGAAAGAAAACGGCGAGTTTAAGGTTGAGT 
ACGACAGAAACTTTAGACCTTACATTTACGCTCTCCTCAAAGATGACTCG 
CAGATTGATGAGGTTAGGAAGATAACCGCCGAGAGGCATGGGAAGATAGT 
GAGAATTATAGATGCCGAAAAGGTAAGGAAGAAGTTCCTGGGGAGGCCGA 
TTGAGGTATGGAGGCTGTACTTTGAACACCCTCAGGACGTTCCCGCAATA 
AGGGATAAGATAAGAGAGCATTCCGCAGTTATTGACATCTTTGAGTACGA 
CATTCCGTTCGCGAAGAGGTACCTAATAGACAAAGGCCTAATTCCAATGG 
AAGGCGATGAAGAGCTCAAGTTGCTCGCATTTGACATAGAAACCCTCTAT 
CACGAAGGGGAGGAGTTCGCGAAGGGGCCCATTATAATGATAAGCTATGC 
TGATGAGGAAGAAGCCAAAGTCATAACGTGGAAAAAGATCGATCTCCCGT 
ACGTCGAGGTAGTTTCCAGCGAGAGGGAGATGATAAAGCGGTTCCTC AAG 
GT G AT AAG G G AG AAAG AT CCC G ATGT T AT AAT T ACCT AC AAC GG C G ATT C 
TTTCGACCTTCCCTATCTAGTTAAGAGGGCCGAAAAGCTCGGGATAAAGC 
TACCCCTGGGAAGGGACGGTAGTGAGCCAAAGATGCAGAGGCTTGGGGAT 
ATGACAGCGGTGGAGATAAAGGGAAGGATACACTTTGACCTCTACCACGT 
GATTAGGAGAACGATAAACCTCCCAACATACACCCTCGAGGCAGTTTATG 
AGGCAATCTTCGGAAAGCCAAAGGAGAAAGTTTACGCTCACGAGATAGCT 
GAGGCCTGGGAGACTGGAAAGGGACTGGAGAGAGTTGCAAAGTATTCAAT 
GGAGG ATGCAAAGGTAACGTACGAGCTCGGTAGGGAGTTCTTCCCAATGG 
AGGCCCAGCTTtCAAGGTTAGTCGGCCAGCCCCTGTGGGATGTTTCTAGG 
TCTTCAA.CTGGCAACTTGGTGGAGTGGTA.CCTCCTCAGGAAGGCCTACGA 
GAGGAATGAATTGGC7CCAAACAAGCCGGATGAGAGGGAGTACGAGAGAA 
GGCTAAGGGAGAGCTACGCTGGGGGATACGTTAAGGAGCCGGAGAAAGGG 
CTCTGGGAGGGGTTAGTTTCCCTAGATTTCAGGAGCCTGTACCCCTCGA.T 
AAT AAT C AC C C AT AACGTCTC ACCGG AT AC GC TG AA.C AGGG AAG G GTG T A. 
GGG AATACGA.TGTCGCCCCAGAGGTTGGGCA.CA-AGTTCTGCAAGGACTTC 
CCGGGGTTTA.TCCCC AG C C T G C T C A_-.GA.GG 7 7 AT T G G A.T G AAAG G C AAG A. 
AAT AAAAAG G AAG AT G AAAG CTTCT AAAG AC C C AAT C G AG AAG AAG AT G C 
TTG AT T A.C AG G C AAC GGGC AAT C AAAAT C C 7 G G C AAAC A.G C AT T T T AC C G 
G AAG AAT GGGTTCCAC T AATT AAAAAC G G7 AAA.G 77 AA G AT A.T T C C G C A. T 
TGGGGACTTCGTTGATGGACTTATGAAGGCGA.ACCAAGGAAAAGTGAAGA 
AAACGGGGGA7ACAGAAG7777AGAAG77GCAGGAA77CA7GCG7777CC 
TTTGACAGGA.AGTCC AAGAAGGCCCGTGTAATGGCAGTGAAAGCCGTGAT 
AAG AC A.C C G T T A.T T C C GG AAA.T G T TT AT A.G AA T A.G TC T T AAAC T C TG G T A. 
G AAAAAT AAC AAT AAC AG AAG GGCATAGCCTATTTGTCT A.T AG G AAC G G G 
GATCTCGTTGAGGCAACTGGGGAGGA.TGTCAA.AATTGGGGATCTTCTTGC 
AGTTCCAAGATCAGTAAACCTACCAGAGAA-AAGGGAACGCTTGAATATTG 
TTGAACTTCTTCTGAATCTCTCACCGGAAGAGACAGAAGATATAATACTT 
ACGATTCCAGTTAAAGGCAGAAAGAACTTCTTCAAGGGAATGTTGAGAAC 
ATTACGTTGGATTTTTGGTGAGGAAAAGAGAGTAAGGACAGCGAGCCGCT 
ATCTAA.GAC ACCTTGAAAATCTCGGATACATAAGGTTGAGGAAAATTGGA 
TACGACATCATTGATAAGGAGGGGCTTGAGAAATATAGAACGTTGTACGA 
GAAACTTGTTGATGTTGTCCGCTATAATGGCA-ACAAGAGAGAGTATTTAG 
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XTGAATTTAATGCTGTCCGGGACGTTATCTCACTAATGCCAGAGGAAGAA 
CTGAAGGAATGGCGTATTGGAACTAGAAATGGATTCAGAATGGGTACGT*r 
CGTAGATATTGATGAAGATTTTGCCAAGCTTCTTGGCTACTATGTGAGCG 
AGGGAAGTGCGAGGAAGTGGAAGAATCAAACTGGAGGTTGGAGTTACACT 
GTGAGATTGTACAACGAGAACGATGAAGTTCTTGACGACATGGAACACTT 
AGCCAAGAAGTTTTTTGGGAAAGTCAAACGTGGAAAGAACTATGTTGAGA 
TACCAAAGAAAATGGCTTATATCATCTTTGAGAGCCTTTGTGGGACTTTG 
GCAGAAAACAAAAGGGTTCCTGAGGTAATCTTTACCTCATCAAAGGGCGT 
TAGATGGGCCTTCCTTGAGGGTTATTTCATCGGCGATGGCGATGTTCACC 
CAAGCAAGAGGGTTCGCCTATCAACGAAGAGCGAGCTTTTAGTAAATGGC 
CTTGTTCTCCTACTTAACTCCCTTGGAGTATCTGCCATTAAGCTTGGATA 
C G AT AGC GG AG TCT AC AGGGT TTAT G T AAAC G AG G AACTTAAGTTT AC GG 
AATACAGAAAGAAAAAGAATGTATATCACTCTCACATTGTTCCAA^GGAT 
ATTCTC AAAGAAACTTTTGGT AAGGTCTTCC AG AAAAAT AT AAGTT a C AA 
GAAATT T AG AG AGC T T GTAG AAAAT G G AAAAC TT G AC AG GG AG AAAGCC A 
AACGCATTGAGTGGTTACTTAACGGAGATATAGTCCTAGATAGAGTCGTA 
GAG AT T AAG AG AG AG T AC T AT GAT G G 7 T AC G T T T AC G AT CT AAG T G T C G A 
T G AAG AT G AG AATT T C CT TG 
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BESTFIT comparison of the deduced amino acid sequences of 
Pyrococcus DNA Polymerase (top line) to T.litoraUs DNA Polymerase (bottom line, 

BESTFIT of: Pyrococcus .Pep from: 1 to: 1019 

TRANSLATE DNA of: Pyrococcus . seq from: 363 to: 3420 
to: T.litoralis.Pep from: 1 to: 1100 

TRANSLATE DNA of: T . litoralis . seq from: 291 to: 5401 

Percent Similarity: 83.219 Percent Identity: 68.302 

1 MILDADYITEDGKPI IRIFKKENGEFKVEYDRNFRPYIYALLKDDSQIDE 50 

1 1 1 1 • f 1 I f • I II M I I I I I I I I I I I I :l . I . : I . M | | | ( I II I I . I : | 
1 MILDTDYITKDGKP IIRIFKKENGEFKIELDPHFQPYIYALLKDDSAIEE 50 

51 VKKITAERHGKIVRIIDAEKVRKKFLGRP IEVWRLYFEHFQDVPAIRDKI 100 

<i ml:' 1 IMI • l|::M I I II I I Ml.: Ill: l.l 11:1:11 

51 IKAIKGERHGKT^/RVLDAVKVKKKFLGREVEVWKLIFEHPQDVPAMRGKI 100 

101 REHSAVIDIFEYDIPFAKRYLIDKGLIPMEGDEELKLLAFDIETLYH^GE 150 

1 1 1 ■ 1 1 : M : I ' I I I 1 I I I I I I I I I I I I I I I I I I I I I I I | II | | : | | | | : 
101 REHPAWDIYEYDIPFAKRYLIDKGLIPMEGDEELKLLAFD IETFYHEGD 150 

151 EFAXGP IIMISYADEEEAKVITWKKIDLPYVEVVSSEREMIKRFLKVIRE 200 
1 1 : ' I • 1 M I I I I I I I I I : I | I | | . | J | | | | : | | | . ( | | | , | | | . . | , 

151 EFGKGEIIMISYADEEEARVITWKNIDLPYVDWSNEREMIKRFVQWKE 200 

201 KDPDVTITYNGDSFDLPYLVKRAEKLGIKLPLGRDG . . SEPKMQRLGDMT 248 

"MII!IMII.tl||||:|||||||:;|.|||, .|I|:M.|| 
201 KDPDVIITYNGDNFDLPYLIKRAEKLGVRLVLGRDKEHPEPKIQRMGDSF 250 

249 AVEIKGRIHFDLYHVIRRTINLPTYTLEAVYEAIFGKPKEKVYAHEIAEA 298 
1 1 1 M I I I I I I I :. I : I I I | I | | | | | | | | | | | | :: | | ,,. | . | . | | , . 

251 AVEIKGRIHFDLFPWRRTINLPTYTLEAVYEAVLGKTKSKLGAEEIAAI 300 

299 WETGKGLERVAKYSMEDAKVTYELGREFFPMEAQLSRLVGQPLWDVSRSS 348 

I ' ' : * : : * : : I. I I I I I I : . I I I I I : II I I I I i : I . : I : I | . : | | | | | | | 
301 WETEESMKKLAQYSMEDARATYELGKEFFPMEAELAKLIGQSVWDVSRSS 350 

349 TGNLVEWYLLRKAYERNELAPNKPDEREYERRLRESYAGGYVKEPEKGLW 398 
IMMIMIII IMIIIIIIIIMI ||. nil.. I lllllllllill 

351 TGMLVEWYLLRVAYARNELAPNKPDEEEYKRKLRTTYLGGYVKEPEKGLW 400 

399 EGLVSLDFRSLYPS IIITHNVSPDTLNREGCREYDVAPEVGHKFCKDFPG 448 

IMMIIMM:MMIMII::MI::MMI ILMIIIIII 
401 ENI I YLDFRS LYPS 1 1 VTHNVSPDTLEKEGCKNYDVAP I VGYRFCKDFPG 450 

449 FIPSLLKRLLOERQEIKRKMKASKDPIEKKMLDYRQRAIKILANSILPEE 498 

AC 1111:1 M:||:MI.. ! I I I I I M I I I I I I I i : I I I I ( I | : | 

451 FIPS ILGDLIAMRQDIKKKMKSTIDP IEKKMLDYRQRAIKLLANS ILPNE 500" 

499 WVP L I KNGKVKI F RI GD F VD GLMKANQGKVK KTGDT E VLE VAG I HAF S FD 548 

, ft , l : ":|.||.:l:.:||:|:::.|. . . : . I I . . : : I II M I . . : (III: 

501 WLPIIENGEIKFVKIGEFINSYMEKQKENVKTVENTEVLEVNNLFAFSFN S50 
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549 RKSKKARVMAVKAVIRHRYSGNVYRIVLNSGRKITITEGHSLFVYRNGDL 5 98 

: 1 I • • I . It I : I I I : I . I . . I I I . I t I I I . I I . I I I ! I . Ml:: 
551 KKIKESEVKKVKALIRHKYKGKAYEIQLSSGRKINITAGHSLrTVRNGEI 600 
• 

599 VEATGEDVKIGDLLAVPRSVNLPEKRERLNIVELLLNLSPEETEDIILTI 648 

I . - I : : : I I M : . . I : . : . I II . : I I . I I : : I I . i I I . I I : : | | 
601 KEVSGDGIKEGDLIVAP KKIKLNEKGVSINIPELISDLSEEETADIVMTI 650 

649 PVKGRKNFFKGMLRTLRWIFGEE . KRVRTASRYLRHLENLGYIRLRKIGY 697 

• • M I M I I It M I I I ! I : I I I I : I : I I .III I t I . I I . I : I . It 
651 SAKGRKNFFKGMLRTLRWMFGEENRRIRTFNRYLFHLEKLGLIKLLPRGY 700 

6 98 D I IDKEGLEKYRTLYEKLVDVVRYNGNKRE YLVEFNAVRD VI S LMPEEEL 7 47 

I - I I : Mill.: 1:1111111111 I I . : : 1 . 11 . : I : . I I 
701 EVTDWERLKKYKQLYEKLAGSVKYNGNKREYLVMFNEIKDFISYFPQKEL 7 50 

7 48 KEWRIGTRNGFRMGTFVDIDEDFAKLLGYYVSEGSARKWKNQTGGWSYTV 7 97 

.11:111 I I I I I I II : I I ! I I M I 1 I I 11.111 M.I 

7 51 EEWKIGTLNGFRTNCILKVDEDFGKLLGYYVSEGYAGAQKNKTGGISYSV 8 00 

7 98 RLYNENDEVLDDMEHLAKKFFGKVKRGKNYVEIPKKMAYI IFESLCGTLA 8 47 

M M I : . : I I : M . : : I M 11 M I : : M : I M M I I M : : : . : i I I M I 
801 KLYNEDPNVLESMKNVAEKFFGKVRVDRNCVSISKKMAYLVMKCLCGALA 8 50 

8 48 ENKRVPEVIFTSSKGVRWAFLEGYFIGDGDVHPSKRVRLSTKSELLVNGL 8 97 

I I I I : I • I i : I I . . . | | | . I | j : II . II i I : I M I I . I I I I I 1 I M . I . I 
851 ENKRIPSVILTSPEPVRWSFLEAYFTGDGDIHPSKRFRLSTKSELLANQL 900 

8 98 VLLLNSLGVSAIKLGYDSGVYRVYVNEELKFTEYRKKKNVYHSHIVPKDI 947 

I M II I II : 1 . : I : I : I II It I M : M : 1 . I . : . : . I I . I . I : : : I I : | 
901 VFLLNSLGISSVKIGFDSGVYRVYINEDLQFPQTSREKNTYYSNLIPKEI 950 

948 LKETFGKVFQKNISYKKFRELVENGKLDREKAKRIEWLLNGDIVLDRWE 997 

I : : M I I II II : . M I t : II I : M M : II M I : I : : M M II I II I . 
951 LRDVFGKEFQKNMTFKKFKELVDSGKLNREKAKLLEFFINGDIVLDRVKS 1000 

998 IKREYYDGYVYDLSVDEDENFL 1019 
M . I : II I 11 I II : : : M II 
1001 VKEKDYEGYVYDLSVEDNENFL 1022 
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